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NEUROTRANSMISSION-ASSOCIATED PROTEINS 

TECHNICAL FIELD 

The invention relates to novel nucleic acids, neurotxansmission-associated proteins encoded by 
5 these nucleic acids, and to the use of these nucleic acids and proteins in the diagnosis, treatment, and 
prevention of autoimmune/inflammatory, cardiovascular, neurological, developmental, cell proliferative, 
transport, psychiatric, metabolic, and endocrine disorders. The invention also relates to the assessment 
of the effects of exogenous compounds on the expression of nucleic acids and 
neurotxansmission-associated proteins. 

10 

BACKGROUND OF THE INVENTION 

The human nervous system, which regulates all bodily functions, is composed of the central 
■ nervous system (CNS), consisting of the brain and spinal cord, and the peripheral nervous system 

(PNS), consisting of afferent neural pathways for conducting nerve impulses from sensory organs to 

15 the CNS, and efferent neural pathways for conducting motor impulses from the CNS to effector 
organs. The PNS can be further divided into the somatic nervous system, which regulates voluntary 
motor activity such as for skeletal muscle, and the autonomic nervous system, which regulates 
involuntary motor activity for internal organs such as the heart, lungs, and viscera. CNS-associated 
proteins function in neuronal signaling, cell adhesion, nerve regeneration, axon guidance, neurogenesis, 

20 and other processes. 

The cerebral cortex or higher brain is the largest structure, consisting of a right and a left 
hemisphere interconnected by the corpus callosum. The cerebral cortex is involved in sensory, motor, 
and integrative functions related to perception, voluntary musculoskeletal movements, and the broad 
range of activities associated with consciousness, language, emotions, and memory. The cerebrum 

25 functions in association with the lower centers of the nervous system. The lower areas of the brain 
such as the medulla, pons, mesencephalon, cerebellum, basal ganglia, substantia nigra, hypothalamus, 
and thalamus control unconscious activities including arterial pressure and respiration, equilibrium, and 
feeding reflexes, such as salivation. 

The central nervous system (CNS) is composed of more than 100 billion neurons at the spinal 

30 cord level, the lower brain level, and the higher brain or cortical level. Neurons transmit electric or 
chemical signals between cells. The spinal cord, a thin, tubular extension of the central nervous 
system within the bony spinal canal, contains ascending sensory and descending motor pathways, and 
is covered by membranes continuous with those of the brainstem and cerebral hemispheres. The 
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spinal cord contains almost the entire motor output and sensory input systems of the trunk and limbs, 
and neuronal circuits in the cord also control rhythmic movements, such as walking, and a variety of 
reflexes. The lower areas of the brain such as the medulla, pons, mesencephalon, cerebellum, basal 
ganglia, substantia nigra, hypothalamus, and thalamus control unconscious activities including arterial 
5 pressure and respiration, equilibrium, and feeding reflexes, such as salivation. Emotions, such as 
anger, excitement, sexual response, and reaction to pain or pleasure, originate in the lower brain. The 
cerebral cortex or higher brain is the largest structure, consisting of a right and a left hemisphere 
interconnected by the corpus callosum. The cerebral cortex is involved in sensory, motor, and 
integrative functions related to perception, voluntary musculoskeletal movements, and the broad range 
10 of activities associated with consciousness, language, emotions, and memory. The cerebrum functions 
in association with the lower centers of the nervous system. 
Nervous system organization and development 

A nerve cell (neuron) contains four regions, the cell body, axon, dendrites, and axon terminal. 
The cell body contains the nucleus and other organelles. The dendrites are processes which extend 
15 outward from the cell body and receive signals from sense organs or from the axons of other neurons. 
These signals are converted to electrical impulses and transmitted to the cell body. The axon, whose 
size can range from one millimeter to more than one meter, is a single process that conducts the nerve 
impulse away from the cell body. Cytoskeletal fibers, including microtubules and neurofilaments, run 
the length of the axon and function in transporting proteins, membrane vesicles, and other. 
20 macromolecules from the cell body along the axon to the axon terminal. Some axons are surrounded 
by a myelin sheath made up of membranes from either an oligodendrocyte cell (CNS) or a Schwann 
cell (PNS). Myelinated axons conduct electrical impulses faster than unmyelinated ones of the same 
diameter. The axon terminal is at the tip of the axon away from the cell body. (See Lodish, H. et aL 
(1986) Molecular Cell Biology Scientific American Books New York NY, pp. 715-719.) 
25 CNS-associated proteins have roles in neuronal signaling, cell adhesion, nerve regeneration, 

axon guidance, neurogenesis, and other functions. Certain CNS-associated proteins form an integral 
part of a membrane or are attached to a membrane. For example, neural membrane protein 35 
(NMP35) is closely associated with neuronal membranes and is known to be highly expressed in the 
rat adult nervous system (Schweitzer, B. et aL (1998) MoL CelL NeuroscL 11:260-273). 
30 Synaptophysin (SY) is a major integral membrane protein of small synaptic vesicles. The 

chromosomal location of SY in human and mouse is on the X chromosome in subbands Xpll.22- 
p 11.23. This region has been implicated in several inherited diseases including Wiskott-Aldrich 
syndrome, three forms of X-linked hypercalciuric nephrolithiaisis, and the eye disorders retinitis 
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pigmentosa 2, congenital stationary night blindness, and Aland Island eye disease (Fisher, S.E. et al. 
(1997) Genomics 45:340-347). Peripherin, or retinal degeneration slow protein (rds), is an integral 
membrane glycoprotein that is present in the rims of photoreceptor outer segment disks. In mammals, 
rds is thought to stabilize the disk rim through heterophilic interactions with related nonglycosylated 
5 proteins. Rds is a mouse neurological mutation that is characterized by abnormal development of rod 
and cone photoreceptors followed by their slow degeneration (Kedzierski, W.J. et al. (1999) 
Neurochem. 72:430-438). 

Each of over a trillion neurons in adult humans connects with over a thousand target cells 
(Tessier-Lavigne, M. et al. (1996) Science 274:1123-1133). These neuronal connections form during 
10 embryonic development. Each differentiating neuron sends out an axon tipped at the leading edge by 
a growth cone. Aided by molecular guidance cues, the growth cone migrates through the embryonic 
environment to its synaptic target. Progressive axon outgrowth occurs during neural development but 
not in the mature mammalian CNS. Following CNS injury, expression of growth-inhibiting molecules 
is enhanced while availability of their growth-promoting counterparts diminishes. Proteins governing 
15 developmental axon guidance contribute to the failure of injured central neurons to regenerate. These 
proteins include Semaphorin3A and the Semaphorin3A receptor proteins neuropilin- 1 and plexin-Al 
(Pasterkamp, R.J. and J. Verhaagen (2001) Brain Res. Brain Res. Rev. 35:36-54). 

Semaphorins function during embryogenesis by providing local signals to specify territories 
inaccessible to growing axons (Puschel, A.W. et al. (1995) Neuron 14:941-948). They consist of at f 
20 least 30 different members and are found in vertebrates, invertebrates, and even certain viruses. All 
semaphorins contain the sema domain which is approximately 500 amino acids in length. Neuropilin, a 
semaphorin receptor, has been shown to promote neurite outgrowth in vitro. The extracellular region 
of neuropilins consists of three different domains: CUB, discoidin, and MAM domains. The CUB and 
the MAM motifs of neuropilin have been suggested to have roles in protein-protein interactions and 
25 are thought to be involved in the binding of semaphorins through the sema and the C-terminal domains 
(reviewed in Raper, J. A. (2000) Curr. Opin. Neurobiol. 10:88-94). 

The guidance of axons during development involves both positive and negative effects (i.e., 
chemoattraction and chemorepulsion). The Slit family of proteins have been implicated in promoting 
axon branching, elongation, and repulsion. Members of the Slit family have been identified in a variety 
30 of organisms, including insects, amphibians, birds, rodents and humans (Guthrie, S. (1999) Current 
Biology 9:R432-R435). Slit proteins are ligands for the repulsive guidance receptor, Roundabout 
(Robo); however, Slit proteins also cause elongation in some assays. A post-translationally processed 
form of Slit appears to be the active form of the protein (Guthrie, S. supra; and Brose, K. et al. (1999) 
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Cell 96:795-806). 

Axon growth is also guided in part by contact-mediated mechanisms involving cell surface and 
extracellular matrix (ECM) molecules. Many ECM molecules, including fibronectin, vitronectin, 
members of the laminin, tenascin, collagen, and thrombospondin families, and a variety of 

5 proteoglycans, can act either as promoters or inhibitors of neurite outgrowth and extension (Tessier- 
Lavigne et al, supra). Receptors for ECM molecules include integrins, immunoglobulin superfamily 
members, and proteoglycans. ECM molecules and their receptors have also been implicated in the 
adhesion, maintenance, and differentiation of neurons (Reichardt, L.R et al. (1991) Ann. Rev. 
Neurosci. 14:531-571). The proteoglycan testican is localized to the post-synaptic area of pyramidal 

10 cells of the hippocampus and may play roles in receptor activity, neuromodulation, synaptic plasticity, 
and neurotransmission (Bonnet, R et al. (1996) J. Biol. Chem. 271:4373-4380). 

Other proteins involved in morphogenetic processes in neural tissues include members of the 
striatin family such as zinedin and striatin, and members of the sialoadhesins such as myelin-associated 
glycoprotein (MAG), Schwann cell myelin protein (SMP) and sialoadhesin. Zinedin and striatin are 

15 calmodulin-binding, WD repeat proteins primarily expressed in the brain (Castets, F. et al. (2000) J. 
Biol. Chem. 275:19970-19977). These two proteins share four homologous stretches of amino acids 
that have been determined to be involved in protein-protein interactions such as caveolin-binding and 
calmodulin-binding. Both zinedin and striatin have been shown to bind calmodulin in a Ca 2+ -dependent 
manner. That striatin is possibly involved in the formation of dendrites has been demonstrated in 

20 cultured embryonic motoneurons in which striatin synthesis was specifically blocked. These cells had 
a reduced number of poorly branched dendrites, whereas the growth of axons appeared normal 
(Bartoli, M. et al. (1999) J. NeurobioL 40:234-243). The role of striatin in the control of locomotion 
has been demonstrated in rats (Bartoli, M. et al. supra ). The sialoadhesins are a major subgroup of 
the I-type lectins which are themselves a subfamily of the lectins. I-type lectins consist mainly of 

25 transmembranous glycoproteins belonging to the immunoglobulin superfamily. The sialoadhesins, 
myelin-associated glycoprotein (MAG), Schwann cell myelin protein (SMP) and sialoadhesin, are 
expressed in the nervous system and are generally characterized as sialic acid-binding adhesion 
molecules. MAG is expressed by oligodendrocytes in the CNS in L (large) and S (small) forms that 
are developmental!/ regulated such that the L form is expressed predominantly during early stages of 

30 myelination and the S form is primarily expressed in adulthood (Tropak, M.B. et al. (1988) MoL Brain 
Res. 4:143-155; Pedraza, L. et al. (1991) J. Neurosci. Res. 29:141-148). MAG is also expressed in 
the Schwann cells in the peripheral nervous system (PNS) and has been implicated in cell recognition 
mechanisms underlying either the promotion or inhibition of cell adhesion and neurite outgrowth 
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(Martini, R. (1994) J. Neurocyte! 21:1-28; Bartsch, U. (1996) J. Neurocyte! 25:303-313). SMP is 
expressed by oligodendrocytes and in Schwann cells, has similar sialic acid-binding properties to MAG 
and shares 43.5% sequence identity with MAG (Dulac, C. et al (1992) Neuron 8:323-334; Collins, B. 
E. et al. (1997) J. Biol. Chem. 272:16889-16895). Sialoadhesin is a membrane glycoprotein originally 

5 described as a cell surface receptor expressed by murine macrophages (Crocker, P.R. and Gordon, S. 
(1986) J. Exp. Med. 164:1862-1875; Crocker, P.R. and Gordon, S. (1989) J. Exp. Med. 169:1333- 
1346). Expression of sialoadhesin has since been demonstrated in rat macrophages which have been 
induced by glucocorticoids and can be further enhanced by the addition of interferon P or y, interleukin 
4 or lipopolysaccharides (Van den Berg T.K. et al. (1996) J. Immunol. 157:3130-3138). Expression 

10 patterns and binding specificities of sialoadhesin suggest a possible role in regulation of myeloid cell 
development (Crocker, P.R. et al. (1990) Blood 76:1131-1138; Crocker P.R. et al. (1995) J. Clin. 
Invest 95:635-643). 

Neurotrophic regulate development, maintenance, and function of vertebrate nervous 
systems. Neurotrophics activate two different classes of receptors, the Trk family of receptor 

15 tyrosine kinases and p75NTR, a member of the TNF receptor superfamily. Through these receptors, 
neurotrophins activate many signaling pathways, including those mediated by ras and members of the 
cdc-42/ras/rho G protein families, and by the MAP kinase, PI-3 kinase, and Jun kinase cascades. 
During development, limiting amounts of neurotrophins function as survival factors to ensure a match 
between the number of surviving neurons and the requirement for appropriate target innervation. 

20 They also regulate cell fate decisions, axon growth, dendrite pruning, the patterning of innervation, and 
the expression of proteins crucial for normal neuronal function, such as neurotransmitters and ion 
channels. These proteins also regulate many aspects of neural function. In the mature nervous 
system, they control synaptic function and synaptic plasticity, while continuing to modulate neuronal 
survival (Huang, E.J. and L.F. Reichardt (2001) Ann. Rev. Neurosci. 24:677-736). Neuritin is a 

25 protein induced by neural activity and by neurotrophins which promote neuritogenesis. 

The neurexophilins are neuropeptide-like proteins which are proteolytically processed after 
synthesis. They are ligands for the neuron-specific cell surface proteins, the a-neurexins. 
Neurexophilins and neurexins may participate in a neuron signaling pathway (Missler, M. and T.C. 
Sudhof (1998) J. Neurosci. 18:3630-3638; Missler, M. et al. (1998) J. Biol. Chem. 273:34716-34723). 

30 Ninjurin is a neuron cell surface protein which plays a role in cell adhesion and in nerve regeneration 
following injury. Ninjurin is up-regulated after nerve injury in dorsal root ganglion neurons and in 
Schwann cells (Araki, T. and J. Milbrandt (1996) Neuron 17:353-361). Ninjurin2 is expressed in 
mature sensory and enteric neurons and promotes neurite outgrowth. Ninjurin2 is upregulated in 
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Schwann cells surrounding the distal segment of injured nerve with a time course similar to that of 
ninjurinl, neural CAM, and LI (Araki, T. and J. Milbrandt (2000) J. Neurosci. 20:187-195). 

Neurexin IV is essential for axonal insulation in the PNS in embryos and larvae. Axonal 
insulation is of key importance for the proper propagation of action potentials. Caspr, a vertebrate 
5 homolog of Neurexin IV — also named paranodin - is found in septate-like junctional structures 
localized to the paranodal region of the nodes of Ranvier, between axons and Schwann cells. 
Caspr/paranodin is implicated in blood-brain barrier formation, and linkage of neuronal membrane 
components with the axonal cytoskeletal network (Bellen, HJ. et al. (1998) Trends Neurosci. 
21:444-449). 

10 Mammalian Numb is a phosphotyrosine-binding (PTB) domain-containing protein which may 

be involved in cortical neurogenesis and cell fate decisions in the mammalian nervous system. 
Numb's binding partner, the LNX protein, contains four PDZ domains and a ring finger domain and 
„ may participate in a signaling pathway involving Numb. PDZ domains have been found in proteins 
which act as adaptors in the assembly of multifunctional protein complexes involved in signaling events 

15 at surfaces of cell membranes (Ponting, CP. (1997) Bioessays 19:469-479). LNX contains a tyrosine 
phosphorylation site which may be important for the binding of other PTE-containing proteins such as 
SHC, an adaptor protein which associates with tyrosine-phosphorylated growth factor receptors and 
downstream effectors (Dho, S.E. et al. (1998) J. Biol. Chem. 273:9179-9187). 

Nogo has been identified as a component of the central nervous system myelin that prevents . 

20 axonal regeneration in adult vertebrates. Cleavage of the Nogo-66 receptor and other , 
glycophosphatidylinositol-linked proteins from axonal surfaces renders neurons insensitive to Nogo-66, 
facilitating potential recovery from CNS damage (Fournier, A.E. et al. (2001) Nature 409:341-346). 

Homeobox transcription factors direct nerve-cell associated tissue patterning and 
differentiation. The presence and function of these proteins appears to be ubiquitous in nematodes, 

25 arthropods, and vertebrates. One example of these proteins is DRG11 , a homeobox transcription 
factor expressed in mammalian sensory neurons, and which appears to be involved in neural crest 
development (Saito, T. et al. (1995) MoL Cell Neurosci. 6:280-292). Cutaneous sensory neurons that 
detect noxious stimuli project to the dorsal horn of the spinal cord, while those innervating muscle 
stretch receptors project to the ventral horn. DRG1 1 is required for the formation of spatio-temporally 

30 appropriate projections from nociceptive sensory neurons to their central targets in the dorsal horn of 
the spinal cord (Chen, Z.F. et al. (2001) Neuron 31:59-73). 
Synapses 

Contact between one neuron and another occurs at a specialized site called the synapse. 
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Many nervous system functions are regulated by diverse synaptic proteins such as synaptophysin, the 
synapsins, growth associated protein 43 (GAP-43), SV-2, and p65, which are distributed in subcellular 
compartments of the synapse. Synaptic terminals also contain many other proteins involved in calcium 
transport, neurotransmission, signaling, growth, and plasticity. At this site, the axon terminal from one 

5 neuron (the presynaptic cell) sends a signal to. another neuron (the postsynaptic cell). Synapses may 
be connected either electrically or chemically. An electrical synapse consists of gap junctions 
connecting the two neurons, allowing electrical impulses to pass directly from the presynaptic to the 
postsynaptic cell. In a chemical synapse, the axon terminal of the presynaptic cell contains membrane 
vesicles containing a particular neurotransmitter molecule. A change in electrical potential at the 

10 nerve terminal results in the influx of calcium ions through voltage-gated channels which triggers the 
release of the neurotransmitter from the synaptic vesicle by exocytosis. The neurotransmitter rapidly 
diffuses across the synaptic cleft separating the presynaptic nerve cell from the postsynaptic cell. The 
neurotransmitter then binds receptors and opens transmitter-gated ion channels located in the plasma 
membrane of the postsynaptic cell, provoking a change in the cell's electrical potential. This change in 

15 membrane potential of the postsynaptic cell may serve either to excite or inhibit further transmission of 
the nerve impulse. 

Presynaptic calcium channel activity is modulated by cysteine-string proteins (CSPs). CSPs 
are secretory vesicle proteins that function in neurotransmission as well as in exocytosis in other cell- 
types. CSPs belong to the DnaJ/hsp40 (heat shock protein) chaperone family. The effect of CSPs on 

20 calcium levels is likely to be downstream of calcium release and is likely to involve exocytosis, possibly 
in connection with G-proteins (Braun, J.E. et al. (1995) Neuropharmacology 34:1361-9136; Magga, 
J.M. et al. (2000) Neuron 28:195-204; Dawson-Scully, K. et al. (2000) J. Neurosci. 20:6039-6047; and 
Chamberlain, L.BL et al. (2001) J. Cell Sci. 114:445-455). Neuregulins (NRGs) mediate between the 
electrical neural activity and molecular components by regulating the expression of ion channel 

25 receptors or transmitter release in synapses. NRGs may also be signaling factors involved in tuning 
locomotion or other higher functions by coordinating excitatory and inhibitory neurons (Ozaki, M. 
(2001) Neuroscientist 7:146-154). 

N- and P/Q-type Ca 2+ channels are localized in high density in presynaptic nerve terminals 
and are crucial elements in neuronal excitation-secretion coupling. In addition to mediating Ca 2+ entry 

30 to initiate transmitter release, they are thought to interact directly with proteins of the synaptic vesicle 
docking/fusion machinery. N-type and P/Q-type Ca 2 * channels are colocalized with syntaxin in 
high-density clusters in nerve terminals. The synaptic protein interaction (synprint) sites in the 
intracellular loop II-HI (LH-IH) of both alpha IB and alpha 1A subunits of N-type and P/Q-type Ca 2+ 
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channels bind to syntaxin, SNAP-25, and synaptotagmin. Presynaptic Ca 2+ channels not only provide 
the Ca 2+ signal required by the exocytotic machinery, but also contain structural elements that are 
integral to vesicle docking, priming, and fusion processes (Catterall, W.A. (1999) Ann. NY Acad. Sci. 
868:144-159). Synaptotagmins are a large family of proteins involved in both regulated and 

5 constitutive vesicular trafficking. They include a neuronal type (synaptotagmin I-V, X, and XI) and a 
ubiquitous type (synaptotagmin VI-IX). Ca 2+ -dependent synaptotagmin activation is involved in neurite 
outgrowth (Mikoshiba, K. et aL (1999) Chem. Fhys. Lipids 98:59-67). 

Proteins associated with the membranes of synaptic vesicles include vamp (synaptobrevin), 
rab3A, synaptophysin, synaptotagmin (p65) and SV2. These membrane proteins function in regulated 

10 exocytosis by regulating neurotransmitter uptake, vesicle targeting, and fusion with the presynaptic 
plasma membrane (Elferink, L.A. and R.BL Scheller (1993) J. Cell Sci. Suppl. 17:75-79). 

Physophilin, also known as the Ac39 subunit of the V-ATPase, is an oligomeric protein that 
binds the synaptic vesicle protein synaptophysin, constituting a complex that may form the exocytotic 
fusion pore. Ac39 is present in a synaptosomal complex which, in addition to synaptophysin, includes 

15 the bulk of synaptobrevin n, and subunits c and Acl 15 of the V0 sector of the V-ATPase. In situ 
hybridization in rat brain reveals a largely neuronal distribution of Ac39/physophilin mRNA which 
correlates spatio-temporaHy with those of subunit c and synaptophysin. Immunohistochemical analysis 
shows that Ac39/physophilin is mostly concentrated in the neuropil with a pattern identical to subunit A - 
and very similar to synaptophysin. Double-labeling immunofluorescence shows a complete 

20 colocalization of Ac39/physophilin with subunit A and a partial colocalization with synaptophysin in the 
neuropil (Carrion- Vazquez M. et al. (1998) Eur. J. Neurosci.l0:1153-1166). 

The plasma membrane dopamine transporter (DAT) is essential for the reuptake of released 
dopamine from the synapse. Uptake of dopamine is temperature- and time-dependent, and is inhibited 
by a variety of compounds, such as cocaine. DAT-knockout mice have been shown to exhibit 

25 extreme hyperactivity and resistance to both cocaine and amphetamine, consistent with the primary 

action of cocaine on DAT (Giros, B. et aL (1996) Nature 379:606-612). The perturbation of the tightly 
regulated DAT also predisposes neurons to damage by a variety of insults. Most notable is the 
selective degeneration of DAT-expressing dopamine nerve terminals in the striatum thought to 
underlie Parkinson's disease. DAT expression can predict the selective vulnerability of neuronal 

30 populations, which suggests that therapeutic strategies aimed at altering DAT function could have 
significant benefits in a variety of disorders (Gary, W.M. et al. (1999) Trends Pharmacol. Sci. 
20:424-429). 

43 KD postsynaptic protein or acetylcholine receptor-associated 43 KD protein (RAPSYN) is 
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thought to play a role in anchoring or stabilizing the nicotinic acetylcholine receptor at synaptic sites. 
RAPS YN is involved in membrane association and may link the nicotinic acetylcholine receptor to the 
underlying postsynaptic cytoskeleton (Buckel, A. et al. (1996) Genomics 35:613-616). Neuritin is a 
protein whose gene is known to be induced by neural activity and by neurotrophic which promote 

5 neurogenesis. Neuraxin is a structural protein of the rat central nervous system that is believed to be 
immunologically related to microtubule-associated protein 5 (MAP5). Neuraxin is a novel type of 
neuron-specific protein which is characterized by an unusual amino acid composition, 12 central 
heptadecarepeats and putative protein and membrane interaction sites. The gene encoding neuraxin is 
unique in the haploid rat genome and is conserved in higher vertebrates. Neuraxin is implicated in 

10 neuronal membrane-microtubule interactions and is expressed throughout the rodent CNS (Rienitz, A. 
et aL (1989) EMBO J. 8:2879-2888). 

Synaptic nuclear envelope-1 (Syne-1) protein is selectively associated with synaptic nuclei and 
may be involved in the formation or maintenance of nuclear aggregates at the neuromuscular junction. 
Syne-1 contains multiple spectrin repeats similar to those found in dystrophin and utrophin, as well as a 

15 domain homologous to the carboxyl-terminal of Klarsicht, a protein associated with nuclei and required 
for a subset of nuclear migrations in Drosophila (Apel, E. D. et al. (2000) J. Biol. Chem. 
275:31986-31995.). 

Clq-Related Factor (CRF) is found to be expressed at highest levels in the brain, particularly 
in the brainstem. In situ hybridization to mouse brain sections demonstrated that CRF transcripts are 

20 most abundant in areas of the nervous system involved in motor function, such as the Purkinje cells of 
the cerebellum, the accessory olivary nucleus, the pons and the red nucleus. CRF is predicted to 
encode a polypeptide with a hydrophobic signal sequence, a collagenous region, and a globular domain 
at the carboxy terminus that shares homology to the Clq signature domain, a subunit of the CI 
enzyme complex that activates the serum complement system (Berube, N. G. et al. (1999) Brain Res. 

25 Mol. Brain Res. 63:233-240). 

Neurotransmitters and neurotransmitter transport proteins 

Contact from one neuron to another occurs at a specialized site called the synapse. At this 
site, the axon terminal from one neuron (the presynaptic cell) sends a signal to another neuron (the 
postsynaptic cell). Synapses may be connected either electrically or chemically. Anelectrical 

30 synapse consists of gap junctions connecting the two neurons, allowing electrical impulses to pass 
directly from the presynaptic to the postsynaptic cell. In a chemical synapse, the axon terminal of the 
presynaptic cell contains membrane vesicles containing a particular neurotransmitter molecule. A 
change in electrical potential at the nerve terminal results in the influx of calcium ions through voltage- 
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gated channels which triggers the release of the neurotransmitter from the synaptic vesicle by 
exocytosis. The neurotransmitter rapidly diffuses across the synaptic cleft separating the presynaptic 
nerve cell from the postsynaptic cell. The neurotransmitter then binds receptors and opens 
transmitter-gated ion channels located in the plasma membrane of the postsynaptic cell, provoking a 
5 change in the cell' s electrical potential. This change in membrane potential of the postsynaptic cell 
may serve either to excite or inhibit further transmission of the nerve impulse. Presynaptic calcium 
channel activity is modulated by cysteine-string proteins (CSPs). CSPs are secretory vesicle proteins 
that function in neurotransmission as well as in exocytosis in other cell-types. CSPs belong to the 
DnaJ/hsp40 (heat shock protein) chaperone family. The effect of CSPs on calcium levels is likely to 

10 be downstream of calcium release and is likely to involve exocytosis, possibly in connection with G- 
proteins (Braun, J.E. et al. (1995) Neuropharmacology 34:1361-9136; Magga, J.M. et al. (2000) 
Neuron 28:195-204; Dawson-Scully, K. et al. (2000) J. Neurosci. 20:6039-6047; and Chamberlain, 
L.H. et al. (2001) J. Cell Sci. 114:445-455). 

Neurotransmitters comprise a diverse group of some 30 small molecules which include 

15 acetylcholine, monoamines such as serotonin, dopamine, and histamine, and amino acids such as 
gamma-aminobutyric acid (GAB A), glutamate, and aspartate, and neuropeptides such as endorphins 
and enkephalins (McCance, K.L. and S.E. Huether (1994) PATHOPHYSIOLOGY, The Biologic 
Basis for Disease in Adults and Children . 2nd edition, Mosby, St. Louis, MO, pp. 403-404). Many of 
these molecules have more than one function and the effects may be excitatory, e.g. to. depolarize the 

20 postsynaptic cell plasma membrane and stimulate nerve impulse transmission, or inhibitory, e.g. to 
hyperpolarize the plasma membrane and inhibit nerve impulse transmission. 

Neurotransmitters and their receptors are targets of pharmacological agents aimed at 
controlling neurological function. For example, GABA is the major inhibitory neurotransmitter in the 
CNS, and GABA receptors are the principal target of sedatives such as benzodiazepines and 

25 barbiturates which act by enhancing GABA-mediated effects (Katzung, B.G. (1995) Basic and 
Clinical Pharmacology . 6th edition, Appleton & Lange, Norwalk, CT, pp. 338-339). 

Two major classes of neurotransmitter transporters are essential to the function of the . 
nervous system. The first class is uptake carriers in the plasma membrane of neurons and glial cells, 
which pump neurotransmitters from the extracellular space into the cell. This process relies on the 

30 Na + gradient across the plasma membrane, particularly the co-transport of Na + . Two families of 
proteins have been identified. One family includes the transporters for GABA, monoamines such as 
noradrenaline, dopamine, and serotonin, and amino acids such as glycine and proline. Common 
structural components include twelve putative transmembrane a-helical domains, cytoplasmic N- and 
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C- termini, and a large glycosylated extracellular loop separating transmembrane domains three and 
four. This family of homologous proteins derives their energy from the co-transport of Na + and CI' 
ions with the neurotransmitter into the cell (Na + /Cr neurotransmitter transporters). The second family 
includes transporters for excitatory amino acids such as glutamate. Common structural components 
5 include 6-10 putative transmembrane domains, cytoplasmic N- and C- tennini, and glycosylations in 
the extracellular loops. The excitatory amino acid transporters are not dependent on CI*, and may 
require intracellular K + ions (Na + /K + - neurotransmitter transporters) (Liu, Y. et al. (1999) Trends Cell 
Biol. 9:356-363). 

The second class of neurotransmitter transporters is present in the vesicle membrane, and 

10 concentrates neurotransmitters from the cytoplasm into the vesicle, before exocytosis of the vesicular 
contents during synaptic transmission. Vesicular transport uses the electrochemical gradient across 
the vesicular membrane generated by a H + -ATPase. Two families of proteins are involved in the 
transport of neurotransmitters into vesicles. One family uses primarily proton exchange to drive 
transport into secretory vesicles and includes the transporters for monoamines and acetylcholine. For 

15 example, the monoamine transporters exchange two luminal protons for each molecule of cytoplasmic 
transmitter. The second family includes the GABA transporter, which relies on the positive charge 
inside synaptic vesicles. The two classes of vesicular transporters show no sequence similarity to 
each other and have structures distinct from those of the plasma membrane carriers (Schloss, P. et al. 
(1994) Curr. Opin. Cell Biol. 6:595-599; Liu et al., supra). 

20 GABA is the predominant inhibitory neurotransmitter and is widely distributed in the 

mammalian nervous system. GABA is cleared from the synaptic cleft by specific, high-affroity, Na + - 
and CI"- dependent transporters, which are thought to be localized to both pre- and postsynaptic 
neurons, as well as to surrounding glial cells. At least four GABA transporters (GAT1-GAT4) have 
been cloned (Liu, Q.-R. et al. (1993) J. Biol. Chem. 268:2106-2112). Studies of [ 3 H]-GABA uptake 

25 into cultured cells and plasma-membrane vesicles isolated from various tissues revealed considerable 
differences in GABA transporter heterogeneity. GABA transporters exhibit differences in substrate 
affinity and specificity, distinct blocker pharmacologies, and different tissue localization. For example, 
the values of GABA uptake of the expressed GAT1 to GAT4 are 6, 79, 18, and 0.8 mM, 
respectively. In addition to transporting GABA, GAT2 also transports betaine; GAT3 and GAT4 also 

30 transport P-alanine and taurine. Pharmacological studies revealed that GABA transport by GAT1 and 
GAT4 is more sensitive to 2,4-diaminobutyric acid aud guavicine than that by GAT2 and GAT3. In 
situ hybridization showed that GAT1 and GAT4 expression is brain specific. GAT2 and GAT3 
mRNAs were detected in tissues such as liver and kidney (Schloss et al., supra; Borden, L.A. (1996) 
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Neurochem. Int. 29:335-356; Nelson, N. (1998) J. Neurochem. 71:1785-1803). 

Human studies indicated that GABA transporter function is reduced in epileptic hippocampi. 
Decreased GABAergic neurotransmission has also been implicated in the pathophysiology of 
schizophrenia (Simpson, M.D. et aL (1992) Psychiatry Res. 42:273-282). 
5 Diazepam binding inhibitor (DBI), also known as endozepine and acyl-Coenzyme (CoA)- 

binding protein, is an endogenous GABA receptor ligand which is thought to down-regulate the effects 
of GABA. DBI binds medium- and long-chain acyl-CoA esters with very high affinity and may 
function as an intracellular carrier of acyl-CoA esters (* 125950 Diazepam Binding Inhibitor; DBI, 
Online Mendelian Inheritance in Man (OMIM); PROSITE PDOC00686 Acyl-CoA-binding protein 
10 signature). 

Glycine serves as one of the major inhibitory neurotransmitters in the mammalian nervous 
system by activating chloride-channel receptors, which are members of a ligand-gated ion-channel 
superfamily (Betz, H. (1990) Neuron 5:383-392). Glycine also facilitates excitatory transmission 
through an aflosteric activation of the N-methyl-D-aspartate (NMD A) receptor (Johnson, J.W. and P. 

15 Ascher (1987) Nature 325:529-531). Forms of glycine transporter include GLYT 1 and GLYT 2. 
Variants of GLYT1 (GLYT1 a/b) are generated by alternative splicing (Liu, Q.-R. et al. (1993) J. 
Biol. Chem. 268:22802-22808). GLYTla is transcribed in both neural and non-neural tissues, whereas 
GLYTlb was detected only in neural tissues (Borowsky, B. et aL (1993) Neuron 10:851-863). High 
levels of GLYTla/b mRNA were found in hippocampus and cortex, implying its involvement in the . 

20 regulation of excitatory synaptic transmission. It is not clear whether GLYTla is expressed in 

neurons, in glia or in both. In contrast, GLYTlb is found almost exclusively in fiber tracts, suggesting 
its localization in glial cells (Schloss et al., supra), GLYT2 is expressed mainly in brainstem and spinal 
cord (Schloss et al., supra). 

The second identified glycine transporter (GLYT2) differs from GLYTla/b by its extended 

25 intracellular amino terminus. The predominant localization of its mRNA in brainstem and spinal cord 
and its insensitivity to N-methyl-aminoacetic acid suggests that GLYT2 terminates signal transduction 
at the strychnine-sensitive inhibitory glycine receptor. It has been proposed that, upon depolarization 
of cells harboring GLYTlb, the transporter runs backwards and releases glycine to act as a 
neuromodulator amino acid at the NMDA receptor (Attwell, D. and M. Bouvier (1992) Curr. Biol. 

30 2:541-543). Such a Ca 2+ -independent, non-vesicular release of neurotransmitters by reverse transport 
was demonstrated for glutamate and serotonin. This evidence suggests that the transmitter 
transporters may be important for both the initiation and termination of neurotransmitter action 
(Schloss et al., supra). 
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The plasma membrane dopamine transporter (DAT) is essential for the reuptake of released 
dopamine from the synapse. Uptake of dopamine is temperature- and time-dependent, and is inhibited 
by a variety of compounds, such as cocaine. DAT- knockout mice have been shown to exhibit 
extreme hyperactivity and resistance to both cocaine and amphetamine, consistent with the primary 

5 action of cocaine on DAT (Giros, B. et al. (1996) Nature 379:606-612). The perturbation of the tightly 
regulated DAT also predisposes neurons to damage by a variety of insults. Most notable is the 
selective degeneration of DAT-expressing dopamine nerve terminals in the striatum thought to 
underlie Parkinson's disease. DAT expression can predict the selective vulnerability of neuronal 
populations, which suggests that therapeutic strategies aimed at altering DAT function could have 

10 significant benefits in a variety of disorders (Gary, W.M. et al. (1999) Trends Pharmacol. Sci. 
20:424-429). 

Creatine transporters are strongly related to transporters for GAB A The primary sequence 
identity between creatine transporter species homologs is very high (98-99%). Pharmacological 
characterization demonstrated high affinity creatine uptake (27-43 mM), which was blocked by 

15 creatine analogs with high affinity. Creatine transporters are widely expressed in a variety of 

mammalian tissues, including brain, adrenal gland, intestine, colon, prostate, thymus, ovary, spleen, 
pancreas, placenta, umbilical cord, thyroid, tongue, pharynx, vertebral discs, jaw, and nasal epithelium. 
Genetic mapping in the mouse localizes the creatine transporter to a region on the X chromosome in 
linkage conservation with the human region Xq28, the location of the genes for several neuromuscular 

20 diseases (Nash, S.R. et al. (1994) Receptors Channels 2:165-174). 

The substrates of a number of cDNA clones encoding proteins of the Na + /Cl*-dependent 
transporter families are still not identified. These are orphan transporters. Identification of the 
substrates for orphan transporters has been difficult because in situ hybridization and 
immunohistochemistry indicate that the transporters are synthesized by phenotypicafly different 

25 neuronal populations, for example ghitaminergic, GABAergic, histaminergic, or serotoninergic neurons. 
One of the transporters, NTT4, exhibits the highest homology to the creatine transporter. It differs 
structurally from other members of this family in having an unusually long loop between 
transmembranes seven and eight (Liu, Q.-R. et al. (1993) EEBS Lett. 315:114-118; Schloss et al., 
supra). 

30 Ghitamate is a major excitatory neurotransmitter in the mammalian central nervous system. 

Electrogenic (Na + /K + )-coupled ghitamate transporters, located in the plasma membranes of nerve 
terminals and glial cells, mediate removal of ghitamate released at excitatory synapses and maintain 
extracellular concentrations below neurototoxic levels. Glutamate transporters achieve this process by 
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co-transport with three sodium ions and one proton, followed by translocation of a potassium ion in the 
opposite direction (Zerangue, N. and M.P. Kavanaugh (1996) Nature 383:634-637). 

Glutamate transporters belong to a large family of transport proteins. The membrane topology 
of the glutamate transporters reveals six membrane-spanning helices in the N-terminal part of the 

5 proteins (Slotboom, DJ. et al. (1999) Microbiol. Mol. Biol. Rev. 63:293-307). The C-terminal half of 
the glutamate transporters is well conserved and constitutes a major part of the translocation pathway 
and contains the binding sites for the substrate and co-transported ions (Zhang, Y. and B.I. Kanner 
(1999) Proc. Natl. Acad. Sci. USA 96:1710-1715). 

The membrane topology of the glutamate transporters reveals six membrane-spanning helices 

10 in the N-terminal part of the proteins (Slotboom, DJ. et al. (1999) Microbiol. Mol. Biol. Rev. 

63:293-307). The C-terminal half of the glutamate transporters is well conserved and constitutes a 
major part of the translocation pathway and contains the binding sites for the substrate and co- 
transported ions (Zhang, Y. and B.I. Kanner (1999) Proc. Natl. Acad. Sci. USA 96:1710-1715). 

Impaired re-uptake of synaptic glutamate, and a reduced expression of glutamate transporters 

15 have been found in the motor cortex of patients with amyotrophic lateral sclerosis (ALS). Inhibition of 
the synthesis of each glutamate transporter subtype using chronic antisense oligonucleotide 
administration, in vitro and in vivo, selectively and specifically reduced the protein expression and 
function of glutamate transporters. The loss of glial glutamate transporters produced elevated 
extracellular glutamate levels, neurodegeneration characteristic of excitotoxicity, and a progressive 

20 paralysis. The loss of the neuronal glutamate transporter did not elevate extracellular glutamate in the 
striatum but produced mild neurotoxicity and resulted in epilepsy (Rothstein, J.D. et al. (1996) Neuron 
16:675-686). 

The vesicular monoamine transporters (VMAT) package cytoplasmic monoamine 
neurotransmitters into secretory vesicles for regulated exocytotic release. VMAT acts as an 
25 electrogenic exchanger of protons and monoamines, using a proton electrochemical gradient. VMAT 
transporters include VMAT1 and VMAT2. The VMAT proteins possess twelve transmembrane 
segments, with both extremities lying on the cytoplasmic side. VMAT proteins are associated with 
distinct vesicle populations in neurons and neuroendocrine cells (Henry, J.-P. et al. (1994) J. Exp. Biol. 
196:251-262). 

30 Vesicular transport is inhibited by the antihypertensive drug reserpine and the related but more 

centrally acting drug tetrabenazine. The mechanism of transport and the biochemistry of VMAT have 
been analyzed with these drugs, using mainly the chromaffin granules from bovine adrenal glands as a 
source of transporters (Peter, D. et al (1994) J. Biol. Chem. 269:7231-7237). 
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Human studies indicated that reserpine can cause a syndrome resembling depression, 
indicating the importance of vesicular transport activity for the control of mood and behavior. The 
psychostimulant amphetamine also disrupts the storage of amines in secretory vesicles, further 
indicating that alterations in vesicular monoamine transport can affect behavior (Sulzer, D. and S. 

5 Rayport (1990) Neuron 5:797-808). 

Human diseases caused by defects in neurotransmitter transporters include schizophrenia, 
Tourette's syndrome, Parkinson's disease, brain ischemia, amyotrophic lateral scerlosis, depression, 
and epilepsy. For example, decreased GAB Aergic neurotransmission has been implicated in the 
pathophysiology of CNS disorders such as epilepsy and schizophrenia. Impaired re-uptake of synaptic 

10 glutamate, and a reduced expression of the glutamate transporter have been found in the motor cortex 
of patients with amyotrophic lateral sclerosis (ALS). The loss of glial glutamate transporters produces 
elevated extracellular glutamate levels, neurodegeneration characteristic of excitotoxicity, and a 
progressive paralysis. The loss of neuronal glutamate transporters produces mild neurotoxicity and 
result in epilepsy (Rothstein, J.D. et al (1996) Neuron 16:675-686). 

15 Transporters for dopamine, norepinephrine, and serotonin have particular significance as 

targets for clinically relevant psychoactive agents including cocaine, antidepressants, and 
amphetamines. Cocaine and antidepressants are transporter antagonists that act with varying degrees 
of specificity to enhance synaptic concentrations of amines by limiting clearance. Amphetamines 
enhance transporter mediated efflux in concert with a depletion of vesicular amine stores (Barker, . 

20 E.L. and R.D. Blakely (1995) Psychopharmacology 28:321-333; Sulzer, D. and S. Rayport (1990) 
Neuron 5:797-808; Wall, S.C. et al. (1995) Mol. Pharmacol. 47:544-550). 

Each of over a trillion neurons in adult humans connects with over a thousand target cells 
(Tessier-Lavigne, M. et al. (1996) Science 274:1123-1133). These neuronal connections form during 
embryonic development. Each differentiating neuron sends out an axon tipped at the leading edge by 

25 a growth cone. Aided by molecular guidance cues, the growth cone migrates through the embryonic 
environment to its synaptic target. Semaphorins are growth cone guidance signals that may function 
during embryogenesis by providing local signals to specify territories inaccessible to growing axons 
(Puschel, A.W. et al (1995) Neuron 14:941-948). 

The guidance of axons during development involves both positive and negative effects (i.e., 

30 chemoattraction and chemorepulsion). The Slit family of proteins have been implicated in promoting 
axon branching, elongation, and repulsion. Members of the Slit family have been identified in a variety 
of organisms, including insects, amphibians, birds, rodents and humans (Guthrie, S. (1999) Current 
Biology 9:R432-R435). Slit proteins appear to be ligands for the repulsive guidance receptor, 
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Roundabout (Robo); however, Slit proteins also cause elongation in some assays. A post- 
translationally processed form of Slit appears to be the active form of the protein (Guthrie, S. supra 
and Brose, K. et al. (1999) Cell 96:795-806). 

Axon growth is also guided in part by contact-mediated mechanisms involving cell surface and 

5 extracellular matrix (ECM) molecules. Many ECM molecules, including fibronectin, vitronectin, 
members of the laminin, tenascin, collagen, and thrombospondin families, and a variety of 
proteoglycans, can act either as promoters or inhibitors of neurite outgrowth and extension (Tessier- 
Lavigne et al., supra) . Receptors for ECM molecules include integrins, immunoglobulin superfamily 
members, and proteoglycans. ECM molecules and their receptors have also been implicated in the 

10 adhesion, maintenance, and differentiation of neurons (Reichardt, L.R et aL (1991) Ann. Rev. 

Neurosci. 14:531-571). The proteoglycan testican is localized to the post-synaptic area of pyramidal 
cells of the hippocampus and may play roles in receptor activity, neuromodulation, synaptic plasticity, 
and neurotransmission (Bonnet, F. et al. (1996) J. Biol. Chem. 271:4373-4380). 

Other nervous system-associated proteins have roles in neuron signaling, cell adhesion, nerve 

15 regeneration, axon guidance, and neurogenesis. The neurexophilins are neuropeptide-like proteins 
which are proteolytically processed after synthesis. They are ligands for the neuron-specific cell 
surface proteins, the a-neurexins. Neurexophilins and neurexins may participate in a neuron signaling 
pathway (Missler, M. and T.C. Sudhof (1998) J. Neurosci. 18:3630-3638; Missler, M. et al. (1998) J. 
Biol. Chem. 273 :34716-34723). Ninjurin is a neuron cell surface protein which plays a role in cell 

20 adhesion and in nerve regeneration following injury. Ninjurin is up-regulated after nerve injury in 
dorsal root ganglion neurons and in Schwann cells (*602062 Ninjurin; NINJ1 OMIM; Araki, T. and 
Milbrandt, J. (1996) Neuron 17:353-361). Mammalian Numb is a phosphotyrosine-binding (PTB) 
domain-containing protein which may be involved in cortical neurogenesis and cell fate decisions in the 
mammalian nervous system. Numb's binding partner, the LNX protein, contains four PDZ domains 

25 and a ring finger domain and may participate in a signaling pathway involving Numb. PDZ domains 
have been found in proteins which act as adaptors in the assembly of multifunctional protein 
complexes involved in signaling events at surfaces of cell membranes (Ponting, CP. (1997) Bioessays 
19:469-479). LNX contains a tyrosine phosphorylation site which may be important for the binding of 
other PTB-containing proteins such as SHC, an adaptor protein which associates with tyrosine- 

30 phosphorylated growth factor receptors and downstream effectors (Dho, S.E. et al. (1998) J. Biol. 
Chem. 273:9179-9187). 

Another family of molecules that appear to be important for neurotransmission are the 
choline-transporter-like CTL1 proteins. The prototypic CTL1 was identified in yeast as a suppressor 
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of a choline transport mutation; however, mammalian homologies have been identified. The proteins 
comprise approximately ten putative transmembrane domains in addition to transporter-like motifs but 
do not appear to be canonical choline transporters. Choline transport is important to neurotransmission 
because choline is a precursor of acetylcholine, required in abundance by cholinergic neurons 

5 (ORegan, S. et al. (2000) Proc. Natl. Acad. Sci. U.S.A. 97:1835-1840). 

Transcriptional regulatory proteins are also essential for the development of the nervous 
system and elements of neurotransmission. A specific class of transcription factors, homeobox 
transcription factors, directs nerve cell-associated tissue patterning and differentiation. The presence 
and function of these proteins appears to be ubiquitous in nematodes, arthropods, and vertebrates. 

10 One example of these proteins is DRG1 1 , a homeobox transcription factor expressed in mammalian 
sensory neurons, and which appears to be involved in neural crest development (Saito, T. et al. (1995) 
Mol. CellNeurosci. 6:280-292). 

cDNA clones were selected by comparing libraries of normal mouse cerebellar cDNA and 
cerebellar cDNA from Purkinje cell degeneration (pcd) mice. An mRNA present in Purkinje neurons 

15 encodes PCD5, a protein of 99 amino acids. PCD5 's expression is restricted to the cerebellum and 
the eye. The gene encoding PCD5 was localized to mouse chromosome 8 (Nordquist, D.T. et al. 
(1988) J. Neurosci. 8(12):4780-4789). 

Neuronal signals are transmitted across the neuromuscular junction (NMJ). Motor axons 
release the molecule agrin to induce the formation of the postsynaptic apparatus in muscle fibers. 

20 Proteins such as dystroglycan, MuSK, and rapsyn participate in the transduction of agrin signals. 
Agrin also functions in the upregulation of gene transcription in myonuclei and the control of 
presynaptic differentiation (Ruegg, M.A. and J.L. Bixby (1998) Trends Neurosci. 21:22-27). 
Neurological protein domains 

CNS-associated proteins can be phosphoproteins. For example, ARPP-21 (cyclic 

25 AMP-regulated phosphoprotein) is a cytosolic neuronal phosphoprotein that is highly enriched in the 
striatum and in other dopaminoceptive regions of the brain. The steady-state level of ARPP-21 
mRNA is developmental^ regulated. But, in the neonatal and mature animal, ARPP-21 mRNA is not 
altered following 6-hydroxydopamine lesions of the substantia nigra or by pharmacologic treatments 
that upregulate the Dl- or D2-dopamine receptors (Ehrlich, M.E. et al (1991) Neurochem. 57:1985- 

30 1991). 

CNS-associated signaling proteins may contain PDZ domains. PDZ domains have been 
found in proteins which act as adaptors in the assembly of multifunctional protein complexes involved 
in signaling events at surfaces of cell membranes. PDZ domains are generally found in membrane- 
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associated proteins including neuronal nitric oxide synthase (NOS) and several dystrophin-associated 
proteins (Ponting, CP. et al. (1997) Bioessays 19:469-479). PSD-95/SAP90 is a membrane- 
associated guanylate kinase found in neuronal cells at the postsynaptic density (PSD) (Takeuchi, M. et 
al. (1997) J. Biol. Chem. 272:11943-11951). PSD-95/SAP90 contains three PDZ domains, one SIB 

5 domain, and one guanylate kinase domain. The PDZ domains mediate interactions with NMDA 
receptors, Shaker-type potassium channels, and brain nitric oxide synthase. S APAPs (SAP90/PSD- 
95-Associated Proteins) promote localization of PSD-95/SAP90 at the plasma membrane. 

CNS-associated proteins may also contain epidermal growth factor (EGF) domains. The 
Notch proteins are transmembrane proteins which contain extracellular regions of repeated EGF 

10 domains. Notch proteins, such as the Drosophila melanogaster neurogenic protein Notch, are 

generally involved in the inhibition of developmental processes. Other members of the Notch family 
are the lin-12 and glp-1 genes of Caenorhabditis elegans. Genetic studies indicate that the lin-12 

r 

and glp-1 proteins act as receptors in specific developmental cell interactions which may be involved in 
certain embyronic defects (Tax, F. E. et al. (1994) Nature 368:150-154). Pecanex, a maternal-effect 
15 neurogenic locus of D. melanogaster, is believed to encode a large transmembrane protein. In the 
absence of maternal expression of the pecanex gene, an embryo develops severe hyperneuralization 
similar to that characteristic of Notch mutant embryos (LaBonne, S. G. et al. (1989) Dev. Biol. 
136:1-116). 

Other CNS-associated signaling proteins contain WW domains. The WW domain is a protein 
20 motif with two highly conserved tryptophans. It is present in a number of signaling and regulatory 
proteins, including Huntingtin interacting protein. Several fibroblast growth factor (FGF) homologous 
factors (i.e., FHF polypeptides) have also been implicated in nervous system development based on 
mRNA expression patterns in mouse and human tissues. Members of the FHF family of polypeptides 
are structurally distinct from prototypic FGFs, consistent with the unusual role of these FGF-related 
25 proteins (Smallwood, P.M. et al (1996) Proc. Natl. Acad. Sci. U.S.A. 93:9850-9857 and Hartung, H. 
et aL (1997) Mech. Dev. 64:31-39). 

CNS-associated proteins may also contain leucine rich repeats (LRR) which are short motifs 
found in numerous proteins from a wide range of species. LRR motifs are of variable length, most 
commonly 20-29 amino acids, and multiple repeats are typically present in tandem LRR motifs are 
30 important for protein/protein interactions, and LRR proteins are involved in cell/cell interactions, 

morphogenesis, and development (Kobe, B. and Deisenhofer, J. (1995) Curr. Opin. Struct. Biol. 5:409- 
416). The human ISLR (immunoglobulin superfamily containing leucine-rich repeat) protein contains a 
C2-type immunoglobulin domain as well as LRR motifs. The ISLR gene is linked to the critical region 
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for Bardet-Biedl syndrome, a developmental disorder of which the most common feature is retinal 
dystrophy (Nagasawa, A. et al. (1999) Genomics 61:37-43). Brain leucine rich repeat protein, a 
member of the LRR family of proteins, possibly functions in nervous system development and 
maintenance (Taniguchi, H. et al. (1996) Brain Res. Mol. Brain Res. 36:45-52). 

5 Disorders associated with neurological processes 

Alzheimer's disease (AD) is a degenerative disorder of the CNS which causes progressive 
memory loss and cognitive decline during mid to late adult life. AD is characterized by a wide range 
of neuropathologic features including amyloid deposits and intra-neuronal neurofibrillary tangles. 
Although the pathogenic pathway leading to neurodegeneration and AD is not well understood, at least 

10 three genetic loci that confer genetic susceptibility to the disease have been identified (Schellenberg, 
G.D. (1995) Proc. Natl. Acad. Sci. 92:8552-8559; Sherrington, R. et al. (1995) Nature 375:754-760). 

Familial British dementia (FBD), is an autosomal dominant disease featuring amyloid plaques 
surrounded by astrocytes and microglia, neurofibrillary tangles, neuronal loss, and progressive 
dementia. The BRI gene on chromosome 13 encodes a 4 kD peptide, A-Bri. This membrane- 

15 anchored protein is a primary constituent of amyloid deposits, and its presence in lesions from the CNS 
of FBD patients maybe a contributive factor of this disease (El-Agnaf, O.M.A. et al. (2001) 
Biochemistry 40:3449-3457). 

Astrocytomas, and the more malignant glioblastomas, are the most common primary tumors of 
the brain, accounting for over 65% of primary brain tumors. These tumors arise in glial cells of the 

20 astrocyte lineage. Following infection by pathogens, astrocytes function as antigen-presenting cells 
and modulate the activity of lymphocytes and macrophages. Astrocytomas constitutively express 
many cytokines and interleukins that are normally produced only after infection by a pathogen (de 
Micco, C. (1989) J. Neuroimmunol. 25:93-108). In the course of identifying genes related to astrocyte 
differentiation, one cDNA was isolated from an astrocytoma cDNA library that encodes a protein 

25 structurally related to the plant pathogenesis-related (PR) proteins (Murphy, E.V. et al. (1995) Gene 
159:131-135). The glioma pathogenesis-related protein (GliPR) is highly expressed in glioblastoma, 
but not in fetal or adult brain, or in other nervous system tumors. PR proteins are a family of small 
(10-20 kDa), protease resistant proteins induced in plants by viral infections, such as tobacco mosaic 
virus. The synthesis of PR proteins is believed to be part of a primitive immunological response in 

30 plants (van Loon, L.C. (1985) Plant Mol. Biol. 4:111-116). GliPR shares up to 50% homology with the 
PR-1 protein family over a region that comprises almost two thirds of the protein, including a 
conserved triad of amino acids, His-Glu-His, appropriately spaced to form a metal-binding domain 
(Murphy et al., supra). 
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Signaling initiated by the Trk family receptors plays a dynamic role in neurogenic tumors. The 
proto-oncogene Tries encode the high-affinity receptor tyrosine kinases for nerve growth factor (NGF) 
neurotrophic. A rearranged Trk oncogene is often observed in non-neuronal neoplasms such as 
colon and papillary thyroid cancers. The proto-oncogene Trks regulates growth, differentiation and 

5 apoptosis of tumors of neuronal origin, such as neuroblastoma and medulloblastoma (Nakagawara, A. 
(2001) Cancer Lett.l69:107-114). 

Neuronal thread proteins (NTP) are a group of immunologically related molecules found in the 
brain and neuroectodermal tumor cell lines. NTP expression is increased in neuronal cells during 
proliferation, differentiation, brain development, in Alzheimer's disease (AD) brains, and in 

10 pathological states associated with regenerative nerve sprouting (de la Monte, S.M. et al. (1996) J. 
Neuropathol. Exp. Neurol. 55:1038-1050). Monoclonal antibodies generated to a recombinant NTP, 
AD7c-NTP, isolated from an end-stage AD brain library, showed high levels of NTP 
immunoreactivity in perikarya, neuropil fibers, and white matter fibers of AD brain tissue. In vitro 
studies also demonstrated NTP upregulation, phosphorylation, and translocation from the perikarya to 

15 cell processes and growth cones during growth factor-induced neuritic sprouting and neuronal 

differentiation. Additionally, increased NTP immunoreactivity was found in Down syndrome brains 
beginning in the second decade, prior to establishment of widespread AD neurodegeneration, and at an 
age when a low-level or an absence of NTP expression was observed in control brains. These 
findings indicated that abnormal expression and accumulation of NTP in brain may be an early marker 

20 of AD neurodegeneration in Down syndrome (de la Monte, S.M. et al. (1996) J. Neurol. ScL 135:1 18- 
125). Furthermore, the increased expression and accumulation of NTP in AD brain tissue was 
paralleled by corresponding elevations of NTP in cerebrospinal fluid (CSF), and elevated levels of 
NTP were detectable in the CSF early in the course of the disease. 

Ubiquilin is a presenilin-interacting protein. Ubiquilin contains numerous ubiquitin-like 

25 domains thought to be involved in targeting proteins for degradation. However, ubiquilin promotes 
increased presenilin protein accumulation. Ubiquilin, as a modulator of presenilin levels may have 
implications involving various cellular functions. Presenilins are linked to a variety of biological 
processes, including calcium regulation, Notch signaling, apoptosis, regulation of the cell cycle, 
including the unfolded-protein response and 0-amyloid precursor protein-associated gamma secretase 

30 activity. Expression of ubiquilin is highest in human brain neurons and is associated with neurofibrillary 
tangles and Lewy bodies of Alzheimer's disease and Parkinson's disease brains, respectively 
(MahAL., et aL (2000) J. CellBiol. 151:847-862). 

Fe65-like protein (Fe65L2), a new member of the Fe65 protein family, is one of the ligands 
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that interacts with the cytoplasmic domain of Alzheimer beta-amyloid precursor protein (APP). 
Transgenic mice expressing APP simulate some of the prominent behavioral and pathological features 
of Alzheimer's disease, including age-related impairment in learning and memory, neuronal loss, gliosis, 
neuritic changes, amyloid deposition, and abnormal tau phosphorylation (Duilio, A. et al. (1998) 

5 Biochem. J. 330:513-519). 

Amyotrophic lateral sclerosis (ALS) is characterized by motor neuron death, altered 
peroxidase activity of mutant SOD1, changes in intracellular copper homeostasis, protein aggregation, 
and changes in the function of ghitamate transporters leading to excitotoxicity. Neurofilaments and 
peripherin appear to play some part in motor neuron degeneration. ALS is occasionally associated 

10 with mutations of the neurofilament heavy chain gene (Al-Chalabi, A. and P.N. Leigh (2000) Curr. 
Opin. Neurol. 13:397-405). Cytoskeletal abnormalities such as abnormal inclusions containing 
neurofilaments (NFs) and/or peripherin, reduced mRNA levels for the NF light (NF-L) protein and 
mutations in the NF heavy (NF-H) gene have been observed in ALS. Intermediate filament inclusions 
containing peripherin may play a contributory role in ALS (Julien, J.P. and J.M. Beaulieu (2000) J. 

15 Neurol. Sci. 180:7-14). 

Miller-Dieker syndrome (MDS) or isolated lissencephaly syndrome (ILS) are characterized by 
a smooth cerebral surface, a thickened cortex with four abnormal layers, and misplaced neurons. 
Both conditions may result from deletion or mutation in the LIS1 gene. The lissencephaly gene 
product Lisl is a component of evolutionarily conserved intracellular multiprotein complexes essential 

20 for neuronal migration, and which may be components of the machinery for cell proliferation and 
intracellular transport (Leventer, R.J. et al. (2001) Trends Neurosci. 24:489-492). NudC, a nuclear 
movement protein, interacts with Lisl (Morris, S.M. et al. (1998) Curr. Biol. 8:603-606). 

CNS-associated proteins can also be phosphoproteins. For example, ARPP-21 (cyclic 
AMP-regulated phosphoprotein) is a cytosolic neuronal phosphoprotein that is highly enriched in the 

25 striatum and in other dopaminoceptive regions of the brain. The steady-state level of ARPP-21 

mRNA is developmental^ regulated. But, in the neonatal and mature animal, ARPP-21 mRNA is not 
altered following 6-hydroxydopamine lesions of the substantia nigra or by pharmacologic treatments 
that upregulate the Dl- or D2-dopamine receptors. (Ehrlich, M. E. et al. (1991) Neurochem. 57:1985- 
1991.) 

30 Retinitis pigmentosa comprises a group of slowly progressive, inherited disorders of the retina 

that cause loss of night vision and peripheral visual field in adolescence. A recessive nonsense 
mutation in the Drosophila opsin gene causes photoreceptor degeneration. In some families, genes 
encoding rhodopsin and peripherin/RDS map very close to the disease loci. Rhodopsin and 
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peripherin/RDS mutations have been found in approximately 30% of all autosomal dominant cases 

(Shastry, B.S. (1994) Am. J. Med. Genet. 52:467-474). 

Astrocytomas, and the more malignant glioblastomas are primary tumors of the brain. The 

glioma pathogenesis-related protein (GliPR) is highly expressed in glioblastoma, but not in fetal or adult 
5 brain, or in other nervous system tumors (Murphy, E.V. et al. (1995) Gene 159:131-135). Signaling 

initiated by the Trk family receptors plays a dynamic role in neurogenic tumors. The proto-oncogene 

Trks encode the high-affinity receptor tyrosine kinases for nerve growth factor (NGF) neurotrophic. 

A rearranged Trk oncogene is often observed in non-neuronal neoplasms such as colon and papillary 

thyroid cancers. The proto-oncogene Trks regulates growth, differentiation and apoptosis of tumors of 
10 neuronal origin, such as neuroblastoma and medulloblastoma (Nakagawara, A. (2001) Cancer 

Lett.l69:107-114). 

Synaptic proteins are involved in Alzheimer's disease (AD) and other disorders including 
ischemia, a variety of disorders where synapse-associated proteins are abnormally accumulated in the 
nerve terminals or synaptic proteins are altered after denervation, and neoplastic disorders (Masliah, 

15 E. and R. Terry (1993) Brain Pathol. 3 :77-85). Synaptophysin (S Y), a major integral membrane 
protein of small synaptic vesicles, is on the X chromosome in subbands Xpll.22-pll.23, a region 
implicated in several inherited diseases including Wiskott-Aldrich syndrome, three forms of X-linked 
hypercalciuric nephrolithiaisis, and the eye disorders retinitis pigmentosa 2, congenital stationary night 
blindness, and Aland Island eye disease (Fisher, S.E. et al. (1997) Genomics 45:340-347). 

20 Mutations in the BRI2 isoform of the BRI gene family are associated with dementia in 

humans (Vidal, R. et al. (2001) Gene 266:95-102). 

Changes in the molecular and cellular components of neuronal signaling systems correlate 
with the effects on mood and cognition observed after long-term treatment with antidepressant drugs. 
Two serine/threonine kinases, Ca 2+ /calmodulin-dependent protein kinase II and cyclic AMP-dependent 

25 protein kinase, are activated in the brain following antidepressant treatment. Associated changes in 
the phosphorylation of selected protein substrates in subcellular compartments including presynaptic 
terminals and microtubules may contribute to the modulation of synaptic transmission observed with 
antidepressants (Popoli, M. et al. (2001) Pharmacol. Ther. 89:149-170). Reserpine can cause a 
syndrome resembling depression, indicating the importance of vesicular transport activity for the 

30 control of mood and behavior. The psychostimulant amphetamine also disrupts the storage of amines 
in secretory vesicles, further indicating that alterations in vesicular monoamine transport can affect 
behavior (Sulzer, D. and S. Rayport (1990) Neuron 5:797-808). 

Decreased GAB AergLc neurotransmission has been implicated in the pathophysiology of CNS 
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disorders such as epilepsy and schizophrenia. Impaired re-uptake of synaptic glutamate and a reduced 
expression of the glutamate transporter have been found in the motor cortex of patients with 
amyotrophic lateral sclerosis (ALS). The loss of glial glutamate transporters produces elevated 
extracellular glutamate levels, neurodegeneration characteristic of excitotoxicity, and a progressive 

5 paralysis. The loss of neuronal glutamate transporters produces mild neurotoxicity and results in 
epilepsy (Rothstein, J.D. et al. (1996) Neuron 16:675-686). GABA transporter function is reduced in 
epileptic hippocampi. Transporters for dopamine, norepinephrine, and serotonin have particular 
significance as targets for clinically relevant psychoactive agents including cocaine, antidepressants, 
and amphetamines. Cocaine and antidepressants are transporter antagonists that act with varying 

10 degrees of specificity to enhance synaptic concentrations of amines by limiting clearance. 

Amphetamines enhance transporter mediated efflux in concert with a depletion of vesicular amine 
stores (Barker, E.L. and R.D. Blakely (1995) Psychopharmacology 28:321-333; Sulzer, D. and S. 
Rayport (1990) Neuron 5:797-808; Wall, S.C. et al. (1995) Mol. Pharmacol. 47:544-550). 

The jx-opioid receptor (MOR) mediates the actions of analgesic agents including morphine, 

15 codeine, methadone, and fentanyl as well as heroin. MOR is functionally coupled to a G-protein- 
activated potassium channel (Mestek A. et al. (1995) J. Neurosci. 15:2396-2406). A variety of MOR 
subtypes exist. Alternative splicing has been observed with MOR-1 as with a number of G 
protein-coupled receptors including somatostatin 2, dopamine D2, prostaglandin EP3, and serotonin 
receptor subtypes 5-hydroxytryptamine4 and 5-hydroxytxyptamine7 (Pan, Y.X. et al. (1999) Mol 

20 Pharm 56:396-403). 

The central nervous system regulates the innate immune system by elaborating 
anti-inflammatory hormone cascades in response to bacterial products and immune mediators. The 
central nervous system also responds via acetylcholine-mediated efferent signals carried through the 
vagus nerve. Nicotinic cholinergic receptors expressed on macrophages detect these signals and 

25 respond with a dampened cytokine response (Tracey KJ. et al. (2001) FASEB J.15:1575-1576). 

Machado- Joseph disease (MID) is an autosomal dominant, neurodegenerative disorder 
characterized by cerebellar ataxia, pyramidal and extra-pyramidal signs, peripheral nerve palsy, 
external ophthalmoplegia, facial and lingual f asciculation and bulging. The MID josephin protein 
(amino acid residues 29-62) is a predominantly cytoplasmic protein associated with human neurons, 

30 but also detected in the nuclei of neurons and glial cells (Mruk, D.D. and Cheng, C. Y. (1999) J. Biol. 
Chem. 274:27056-27068). 

Juvenile neuronal ceroid lipofuscinosis (JNCL), also known as Batten disease, is an autosomal 
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recessive lysosomal storage disease associated with mutations in CLN3. The predominant mutation in 
CLN3 is a 1.02 kb genomic deletion that accounts for nearly 85% of the disease alleles. Additional 
missense and nonsense mutations have been described. Some missense substitutions result in a 
protracted phenotype, with delays in the onset of classical clinical features, whereas others lead to 

5 classical JNCL. CLN3 is a hydrophobic protein containing 5 to 7 transmembrane domains. CLN3 is 
found to be highly associated with lysosome-associated membrane protein H in non-neuronal cells and 
with synaptophysin in neuronal cell lines (Haskell, R.E. et al. (2000) Hum. Mol. Genet. 9:735-744). 

Dysferlin is the protein product of the gene mutated in patients with an autosomal recessive 
limb-girdle muscular dystrophy type 2B (LGMD2B) and a distal muscular dystrophy, Miyoshi 

10 myopathy. Dysferlin is homologous to a Caenorhabditis elegans spermatogenesis factor, FER-1. 
Otoferlin, another human FER-l-like protein (ferlin), is responsible for autosomal recessive 
nonsyndromic deafness (DFNB9). All the ferlins are characterized by sequences corresponding to 
multiple C2 domains that share the highest level of homology with the C2A domain of rat 
synaptotagmin IH (Britton S. et al. (2000) Genomics 68:3 13-321). 

15 Expression profiling 

Microarrays are analytical tools used in bioanalysis. A microarray has a plurality of molecules 
spatially distributed over, and stably associated with, the surface of a solid support. Microarrays of 
polypeptides, polynucleotides, and/or antibodies have been developed and find use in a variety of 
applications, such as gene sequencing, monitoring gene expression, gene mapping, bacterial 

20 identification, drug discovery, and combinatorial chemistry. 

One area in particular in which microarrays find use is in gene expression analysis. Array 
technology can provide a simple way to explore the expression of a single polymorphic gene or the 
expression profile of a large number of related or unrelated genes. When the expression of a single 
gene is examined, arrays are employed to detect the expression of a specific gene or its variants. 

25 When an expression profile is examined, arrays provide a platform for identifying genes that are tissue 
specific, are affected by a substance being tested in a toxicology assay, are part of a signaling 
cascade, carry out housekeeping functions, or are specifically related to a particular genetic 
predisposition, condition, disease, or disorder. 
Lung Cancer 

30 Lung cancer is the leading cause of cancer death for men and the second leading cause of 

cancer death for women in the U.S. The vast majority of lung cancer cases are attributed to smoking 
tobacco, and increased use of tobacco products in third world countries is projected to lead to an 
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epidemic of lung cancer in these countries. Exposure of the bronchial epithelium to tobacco smoke 
appears to result in changes in tissue morphology, which are thought to be precursors of cancer. Lung 
cancers are divided into four histopathologically distinct groups. Three groups (squamous cell 
carcinoma, adenocarcinoma, and large cell carcinoma) are classified as non-small cell lung cancers 
5 (NSCLCs). The fourth group of cancers is referred to as small cell lung cancer (SCLC). Collectively, 
NSCLCs account for -70% of cases while SCLCs account for -18% of cases. The molecular and 
cellular biology underlying the development and progression of lung cancer are incompletely 
understood. 

Deletions on chromosome 3 are common in this disease and are thought to indicate the 
10 presence of a tumor suppressor gene in this region. Activating mutations in K-ras are commonly 
found in lung cancer and are the basis of one of the mouse models for the disease. 
Ovarian Cancer 

Ovarian cancer is the leading cause of death from a gynecologic cancer. The majority of 

ovarian cancers are derived from epithelial cells, and 70% of patients with epithelial ovarian cancers 
15 present with late-stage disease. As a result, the long-term survival rates for this disease is very low. 

Identification of early-stage markers for ovarian cancer would significantly increase the survival rate. 

The molecular events that lead to ovarian cancer are poorly understood. . Some of the known 

aberrations include mutation of p53 and microsatellite instability. Since gene expression patterns are . 

likely to vary when normal ovary is compared to ovarian tumors, examination of gene expression in 
20 these tissues to identify possible markers for ovarian cancer is particularly relevant to improving 

diagnosis, prognosis, and treatment of this disease. 

Colon Cancer 

Colorectal cancer is the second leading cause of cancer deaths in the United States. Colon 
cancer is associated with aging, since 90% of the total cases occur in individuals over the age of 55. 

25 A widely accepted hypothesis is that several contributing genetic mutations must accumulate over time 
in an individual who develops the disease. To understand the nature of genetic alterations in colorectal 
cancer, a number of studies have focused on the inherited syndromes. The first known inherited 
syndrome, Familial Adenomatous Polyposis (FAP), is caused by mutations in the Adenomatous 
Polyposis Coli gene (APC), resulting in truncated or inactive forms of the protein. This tumor 

30 suppressor gene has been mapped to chromosome 5q. The second known inherited syndrome is 

hereditary nonpolyposis colorectal cancer (HNPCC), which is caused by mutations in mismatch repair 
genes. 
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Although hereditary colon cancer syndromes occur in a small percentage of the population 
and most colorectal cancers are considered sporadic, knowledge from studies of the hereditary 
syndromes can be generally applied. For instance, somatic mutations in APC occur in at least 80% of 
indiscriminate colon tumors. APC mutations are thought to be the initiating event in the disease. 
5 Other mutations occur subsequently. Approximately 50% of colorectal cancers contain activating 
mutations in ras, while 85% contain inactivating mutations in p53. Changes in these genes lead to 
gene expression changes in colon cancer. Less is understood about downstream targets of these 
mutations and the role they may play in cancer development and progression. 
Tangier Disease 

10 Tangier disease (TO) is a rare genetic disorder characterized by near absence of circulating 

high density lipoprotein (HDL) and the accumulation of cholesterol esters in many tissues, including 
tonsils, lymph nodes, liver, spleen, thymus, and intestine. Low levels of HDL represent a clear 
predictor of premature coronary artery disease and homozygous TD correlates with a four- to six-fold 
increase in cardiovascular disease compared to controls. The major cardioprotective activity of HDL 

15 is ascribed to its role in reverse cholesterol transport, the flux of cholesterol from peripheral cells such 
as tissue macrophages, through plasma lipoproteins to the liver. The HDL protein, apolipoprotein A-I, 
plays a major role in this process, interacting with the cell surface to remove excess cholesterol and 
phospholipids. Recent studies have shown that this pathway is severely impaired in TD and the defect 
lies in a specific gene, the ABC1 transporter. This gene is a member of the family of ATP-binding 

20 cassette transporters, which utilize ATP hydrolysis to transport a variety of substrates across 
membranes. > 
RNA Expression 

Atherosclerosis and the associated coronary artery disease and cerebral stroke represent the 
most common cause of death in industrialized nations. Although certain key risk factors have been 
25 identified, a full molecular characterization that elucidates the causes and provide care for this 
complex disease has not been achieved. Molecular characterization of growth and regression of 
atherosclerotic vascular lesions requires identification of the genes that contribute to features of the 
lesion including growth, stability, dissolution, rupture and, most lethally, induction of occlusive vessel 
thrombus. 

30 An early step in the development of atherosclerosis is formation of the "fatty streak". 

Lipoproteins, such as the cholesterol-rich low-density lipoprotein (LDL), accumulate in the 
extracellular space of the vascular intima, and undergo modification. Oxidation of LDL occurs most 
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avidly in the sub-endothelial space where circulating antioxidant defenses are less effective. The 
degree of LDL oxidation affects its interaction with target cells. 'Minimally oxidized" LDL (MM- 
LDL) is able to bind to LDL receptor but not to the oxidized LDL (Ox-LDL) or "scavenger" 
receptors that have been identified, including scavenger receptor types A and B, CD36 , 

5 CD68/macrosialin and LOX-1 (Navab et al. (1994) Arterioscler Thromb Vase Biol 16:83 1-842; 
Kodama et al. (1990) Nature 343:531-535; Acton et al. (1994) J Biol Chem 269:21003-21009; 
Endemann et al. (1993) J Biol Ohem 268:1 1811-11816; Ramprasad et al. (1996) Proc Natl Acad Sci 
92:14833-14838; Kataoka et al (1999) Circulation 99:3110-3117). MM-LDL can increase the 
adherence and penetration of monocytes, stimulate the release of monocyte chemotactic protein 1 

10 (MCP-1) by endothelial cells, and induce scavenger receptor A (SRA) and CD36 expression in 
macrophages (Cushing et al. (1990) Proc Natl Acad Sci 87:5134-5138; Yoshida et al. (1998) 
Arterioscler Thromb Vase Biol 18:794-802; Steinberg (1997) J Biol Chem 272:20963-20966). SRA 
and the other scavenger receptors can bind Ox-LDL and enhance uptake of lipoprotein particles. 

Mononuclear phagocytes enter the intima, differentiate into macrophages, and ingest modified 

15 lipids including Ox-LDL. In most cell types, cholesterol content is tightly controlled by feedback 
regulation of LDL receptors and biosynthetic enzymes (Brown and Goldstein (1986) Science 232:34- 
47). In macrophages, however, the additional scavenger receptors lead to unregulated uptake.of 
cholesterol (Brown and Goldstein (1983) Annu Rev Biochem 52:223-261) and accumulation of 
multiple intracellular lipid droplets producing a "foam cell" phenotype. Cholesterol-engorged and dead 

20 macrophages contribute most of the mass of early "fatty streak" plaques and typical "advanced" 

lesions of diseased arteries. Numerous studies have described a variety of foam cell responses that 

contribute to growth and rupture of atherosclerotic vessel wall plaques. These responses include 

production of multiple growth factors and cytokines, which promote proliferation and adherence of 

neighboring cells; chemokines, which further attract circulating monocytes into the growing plaque; 

25 proteins, which cause remodeling of the extracellular matrix; and tissue factor, which can trigger 

i 

thrombosis (Ross (1993) Nature 362:801-809; Quin et al. (1987) Proc Natl Acad Sci 84:2995-2998). 
Thus, cholesterol-loaded macrophages which occur in abundance in most stages of the atherosclerotic 
plaque formation contribute to inception of the atheroscerotic process and to eventual plaque rupture 
and occlusive thrombus. 

30 During Ox-LDL uptake, macrophages produce cytokines and growth factors that elicit further 

cellular events that modulate atherogenesis such as smooth muscle cell proliferation and production of 
extracellular matrix. Additionally, these macrophages may activate genes involved in inflammation 
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including inducible nitric oxide synthase. Thus, genes differentially expressed during foam cell 
formation may reasonably be expected to be markers of the atherosclerotic process. 
Association of NTRAN polynucleotides with Parkinson's Disease 

Several genes have been identified as showing linkage to autosomal dominant forms of 

5 Parkinson's Disease (PD). PD is a common neurodegenerative disorder causing bradykinesia, resting 
tremor, muscular rigidity, and postural instability. Cytoplasmic eosinophilic inclusions called Lewy 
bodies, and neuronal loss especially in the substantia nigra pars compacta, are pathological hallmarks 
of PD (Valente, E.M. et al (2001) Am. J. Hum. Genet. 68:895-900). Lewy body Parkinson disease 
has been thought to be a specific autosomal dominant disorder (Wakabayashi, K. et al. (1998) Acta 

10 Neuropath. 96:207-210). Juvenile parkinsonism may be a specific autosomal recessive disorder 

(Matsumine, H. et al. (1997) Am. J. Hum. Genet. 60: 588-596, 1997). (Online Mendelian Inheritance 
in Man, OMM. Johns Hopkins University, Baltimore, MD. MDM Number: 168600: Sept. 9, 2002: . 
World Wide Web URL: http://www.ncbi.nlm.nih.gov/omim/) 

The discovery of new neurotransmission-associated proteins, and the polynucleotides encoding 

15 them, satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of autoimmune/inflammatory disorders, cardiovascular disorders, 
neurological diseases, developmental disorders, and cell proliferative diseases and disorders, including 
cancer, and in the assessment of the effects of exogenous compounds on the expression of nucleic 
acid and amino acid sequences of neurotransmission-associated proteins. 

20 There is a need in the art for new compositions, including nucleic acids and proteins, for the 

diagnosis, prevention, and treatment of autoimmune/inflammatory, cardiovascular, neurological, 
developmental, cell proliferative, transport, psychiatric, metabolic, and endocrine disorders. 

SUMMARY OF THE INVENTION 

25 Various embodiments of the invention provide purified polypeptides, 

neurotransmission-associated proteins, referred to collectively as 'NTRAN' and individually as 
'NTRAN-1/ 'NTRAN-2,' 'NTRAN-3,' 'NTRAN-4,' 'NTRAN-5,' 'NTRAN-6/ 'NTRAN-7,' 
'NTRAN-8/ 'NTRAN-9,' 'NTRAN-10/ 'NTRAN-11,' 'NTRAN-12,' 'NTRAN-13/ 'NTRAN-14/ 
'NTRAN-15,' 'NTRAN-16,' 'NTRAN-17,' 'NTRAN-18,' 'NTRAN-19,' 'NTRAN-20,' 'NTRAN- 

30 21/ and 'NTRAN-22' and methods for using these proteins and their encoding polynucleotides for the 
detection, diagnosis, and treatment of diseases and medical conditions. Embodiments also provide 
methods for utilizing the purified neurotransmission-associated proteins and/or their encoding 
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polynucleotides for facilitating the drug discovery process, including determination of efficacy, dosage, 
toxicity, and pharmacology. Related embodiments provide methods for utilizing the purified 
neurotransmission-associated proteins and/or their encoding polynucleotides for investigating the 
pathogenesis of diseases and medical conditions. 
5 An embodiment provides an isolated polypeptide selected from the group consisting of a) a 

polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
22, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical or at 
least about 90% identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-22, c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
10 from the group consisting of SEQ ID NO:l-22, and d) an immunogenic fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22. Another 
embodiment provides an isolated polypeptide comprising an amino acid sequence of SEQ ID NO: 1-22. 

Still another embodiment provides an isolated polynucleotide encoding a polypeptide selected 
from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the 
15 group consisting of SEQ ID NO:l-22, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical or at least about 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-22, c) a biologically active fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
20 consisting of SEQ ID NO:l-22. In another embodiment, the polynucleotide encodes a polypeptide 
selected from the group consisting of SEQ ID NO:l-22. In an alternative embodiment, the 
polynucleotide is selected from the group consisting of SEQ ID NO:23-44. 

Still another embodiment provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
25 consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-22, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-22, c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-22, and d) an immunogenic fragment of 
30 a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-22. 
Another embodiment provides a cell transformed with the recombinant polynucleotide. Yet another 
embodiment provides a transgenic organism comprising the recombinant polynucleotide. 
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Another embodiment provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO:l-22, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from the group 

5 consisting of SEQ ID NO: 1-22, c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-22> 
The method comprises a) culturing a cell under conditions suitable for expression of the polypeptide, 
wherein said cell is transformed with a recombinant polynucleotide comprising a promoter sequence 

10 operably linked to a polynucleotide encoding the polypeptide, and b) recovering the polypeptide so 
expressed. 

Yet another embodiment provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22, b) a polypeptide comprising a naturally 

15 occurring amino acid sequence at least 90% identical or at least about 90% identical to an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-22, c) a biologically active fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-22, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ED NO:l-22. 

20 Still yet another embodiment provides an isolated polynucleotide selected from the group 

consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:23-44, b) a polynucleotide comprising a naturally occurring polynucleotide 
sequence at least 90% identical or at least about 90% identical to a polynucleotide sequence selected 
from the group consisting of SEQ ID NO:23-44, c) a polynucleotide complementary to the 

25 polynucleotide of a), d) a polynucleotide complementary to the polynucleotide of b), and e) an RNA 
equivalent of a)-d). In other embodiments, the polynucleotide can comprise at least about 20, 30, 40, 
60, 80, or 100 contiguous nucleotides. 

Yet another embodiment provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide being selected from the group consisting of a) a polynucleotide comprising a 

30 polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, b) a polynucleotide 
comprising a naturally occurring polynucleotide sequence at least 90% identical or at least about 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, c) a 
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polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the 
sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 
complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 

5 said target polynucleotide, under conditions whereby a hybridization complex is formed between said 
probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex. In a related embodiment, the method can include detecting the amount of 
the hybridization complex. In still other embodiments, the probe can comprise at least about 20, 30, 
40, 60, 80, or 100 contiguous nucleotides. 

10 Still yet another embodiment provides a method for detecting a target polynucleotide in a 

sample, said target polynucleotide being selected from the group consisting of a) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, b) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 

15 NO:23-44, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and b) detecting the presence or absence of said amplified target polynucleotide 
or fragment thereof. In a related embodiment, the method can include detecting the amount of the 

20 amplified target polynucleotide or fragment thereof. 

Another embodiment provides a composition comprising an effective amount of a polypeptide 
selected from the group consisting of a) a polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-22, b) a polypeptide comprising a naturally occurring 
amino acid sequence at least 90% identical or at least about 90% identical to an amino acid sequence 

25 selected from the group consisting of SEQ ID NO: 1-22, c) a biologically active fragment of a 

polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-22, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-22, and a pharmaceutically acceptable excipient. In one 
embodiment, the composition can comprise an amino acid sequence selected from the group consisting 

30 of SEQ ID NO: 1-22. Other embodiments provide a method of treating a disease or condition 

associated with decreased or abnormal expression of functional NTRAN, comprising administering to 
a patient in need of such treatment the composition. 
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Yet another embodiment provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-22, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 

5 amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1-22, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22. The method comprises a) exposing a sample 
comprising the polypeptide to a compound, and b) detecting agonist activity in the sample. Another 

10 embodiment provides a composition comprising an agonist compound identified by the method and a 
pharmaceutically acceptable excipient Yet another embodiment provides a method of treating a 
disease or condition associated with decreased expression of functional NTRAN, comprising 
administering to a patient in need of such treatment the composition. 

Still yet another embodiment provides a method for screening a compound for effectiveness 

15 as an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-22, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical or at least about 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l-22, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 

20 consisting of SEQ ID NO: 1-22, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-22. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in 
the sample. Another embodiment provides a composition comprising an antagonist compound 
identified by the method and a pharmaceutically acceptable excipient. Yet another embodiment 

25 provides a method of treating a disease or condition associated with overexpression of functional 
NTRAN, comprising administering to a patient in need of such treatment the composition. 

Another embodiment provides a method of screening for a compound that specifically binds to 
a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-22, b) a polypeptide comprising a 

30 naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-22, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
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ID NO:l-22, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-22. The method comprises a) combining the 
polypeptide with at least one test compound under suitable conditions, and b) detecting binding of the 
polypeptide to the test compound, thereby identifying a compound that specifically binds to the 
5 polypeptide. 

Yet another embodiment provides a method of screening for a compound that modulates the 
activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-22, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 

10 amino acid sequence selected from the group consisting of SEQ ID NO: 1-22, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO:l-22, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-22. The method comprises a) combining the 
polypeptide with at least one test compound under conditions permissive for the activity of the 

15 polypeptide, b) assessing the activity of the polypeptide in the presence of the test compound, and c) 
comparing the activity of the polypeptide in the presence of the test compound with the activity of the 
polypeptide in the absence of the test compound, wherein a change in the activity of the polypeptide in 
the presence of the test compound is indicative of a compound that modulates the activity of the 
polypeptide. 

20 Still yet another embodiment provides a method for screening a compound for effectiveness in 

altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, the method 
comprising a) exposing a sample comprising the target polynucleotide to a compound, b) detecting 
altered expression of the target polynucleotide, and c) comparing the expression of the target 

25 polynucleotide in the presence of varying amounts of the compound and in the absence of the 
compound. 

Another embodiment provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; b) 
hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
30 contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:23-44, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
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least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:23-44, iii) a polynucleotide having a sequence complementary to i), iv) a polynucleotide 
complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs 
under conditions whereby a specific hybridization complex is formed between said probe and a target 

5 polynucleotide in the biological sample, said target polynucleotide selected from the group consisting of 
i) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:23-44, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical or at least about 90% identical to a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:23-44, iii) a polynucleotide complementary to the polynucleotide of i), iv) a 

10 polynucleotide complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). 

Alternatively, the target polynucleotide can comprise a fragment of a polynucleotide selected from the 
group consisting of i)-v) above; c) quantifying the amount of hybridization complex; and d) comparing 
the amount of hybridization complex in the treated biological sample with the amount of hybridization 
complex in an untreated biological sample, wherein a difference in the amount of hybridization 

15 complex in the treated biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for foil length polynucleotide and polypeptide 
embodiments of the invention. 
20 Table 2 shows the GenBank identification number and annotation of the nearest GenBank 

homolog, and the PROTEOME database identification numbers and annotations of PROTEOME 
database homologs, for polypeptide embodiments of the invention. The probability scores for the 
matches between each polypeptide and its homolog(s) are also shown. 

Table, 3 shows structural features of polypeptide embodiments, including predicted motifs and 
25 domains, along with the methods, algorithms, and searchable databases used for analysis of the 
polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide embodiments, along with selected fragments of the polynucleotides. 

Table 5 shows representative cDNA libraries for polynucleotide embodiments. 
30 Table 6 provides an appendix which describes the tissues and vectors used for construction of 

the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze polynucleotides and 
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polypeptides, along with applicable descriptions, references, and threshold parameters. 

Table 8 shows single nucleotide polymorphisms found in polynucleotide sequences of the 
invention, along with allele frequencies in different human populations. 

5 DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleic acids, and methods are described, it is understood that 
embodiments of the invention are not limited to the particular machines, instruments, materials, and 
methods described, as these may vary. It is also to be understood that the terminology used herein is 
for the purpose of describing particular embodiments only, and is not intended to limit the scope of the 
10 invention. 

As used herein and in the appended claims, the singular forms "a," "an," and "the" include 
plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to "a 
host cell" includes a plurality of such host cells, and a reference to "an antibody" is a reference to one 
or more antibodies and equivalents thereof known to those skilled in the art, and so forth. 

15 Unless defined otherwise, all technical and scientific terms used herein have the same 

meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be. 
used to practice or test the present invention, the preferred machines, materials and methods are now ., 
described. All publications mentioned herein are cited for the purpose of describing and disclosing the 

20 cell lines, protocols, reagents and vectors which are reported in the publications and which might be 
used in connection with various embodiments of the invention. Nothing herein is to be construed as an 
admission that the invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

'TSTTRAN" refers to the amino acid sequences of substantially purified NTRAN obtained 
25 from any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, 
and human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
NTRAN. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of NTRAN either by directly interacting with 
30 NTRAN or by acting on components of the biological pathway in which NTRAN participates. 

An "allelic variant" is an alternative form of the gene encoding NTRAN. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
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polypeptides whose structure or function may or may not he altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 

5 in a given sequence. 

"Altered" nucleic acid sequences encoding NTRAN include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as NTRAN or a 
polypeptide with at least one functional characteristic of NTRAN. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 

10 the polynucleotide encoding NTRAN, and improper or unexpected hybridization to allelic variants, with 
a locus other than the normal chromosomal locus for the polynucleotide encoding NTRAN. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino 4 
acid residues which produce a silent change and result in a functionally equivalent NTRAN. 
Deliberate amino acid substitutions maybe made on the basis of one or more similarities in polarity, 

15 charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long 
as the biological or immunological activity of NTRAN is retained. For example, negatively charged 
amino acids may include aspartic acid and glutamic acid, and positively charged amino acids may 
include lysine and arginine. Amino acids with uncharged polar side chains having similar hydrophilicity 
values may include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged 

20 side chains having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and 
alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" can refer to an oligopeptide, a peptide, a 
polypeptide, or a protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally 

25 occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid. Amplification 
may be carried out using polymerase chain reaction (PGR) technologies or other nucleic acid 
amplification technologies well known in the art. 

30 The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 

of NTRAN. Antagonists may include proteins such as antibodies, anticalins, nucleic acids, 
carbohydrates, small molecules, or any other compound or composition which modulates the activity of 
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NTRAN either by directly interacting with NTRAN or by acting on components of the biological 

pathway in which NTRAN participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 

thereof, such as Fab, F(aV) 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
5 Antibodies that bind NTRAN polypeptides can be prepared using intact polypeptides or using 

fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 

oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 

translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 

Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 
10 thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 

the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 

15 which bind specifically to antigenic determinants (particular regions or three-dimensional structures on 
the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used 
to elicit the immune response) for binding to an antibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 
specific molecular target Aptamers are derived from an in vitro evolutionary process (e.g., SELEX 

20 (Systematic Evolution of Ligands by Exponential Enrichment), described in U.S . Patent No. 

5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 
Aptamer compositions maybe double-stranded or single-stranded, and may include 
deoxyribonucleotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. The 
nucleotide components of an aptamer may have modified sugar groups (e.g., the 2-OH group of a 

25 ribonucleotide may be replaced by 2'-F or 2-NEy , which may improve a desired property, e.g., 

resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
e.g., a high molecular weight carrier to slow clearance of the aptamer from the circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 
cross-linker (Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13). 

30 The term "intramer" refers to an aptamer which is expressed in vivo. For example, a 

vaccinia virus-based RNA expression system has been used to express specific RNA aptamers at 
higji levels in the cytoplasm of leukocytes (Blind, M. et aL (1999) Proc. Natl. Acad. Sci. USA 
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96:3606-3610). 

The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 
handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 

5 substrates containing right-handed nucleotides. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a polynucleotide having a specific nucleic acid sequence. Antisense compositions 
may include DNA; RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone 
linkages such as phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides 

10 having modified sugar groups such as 2 -methoxyethyl sugars or 2-methoxyethoxy sugars; or 
oligonucleotides having modified bases such as 5-methyl cytosine, 2'-deoxyuracil, or 7-deaza-2- 
deoxyguanosine. Antisense molecules may be produced by any method including chemical synthesis 
or transcription. Once introduced into a cell, the complementary antisense molecule base-pairs with a 
naturally occurring nucleic acid sequence produced by the cell to form duplexes which block either 

15 transcription or translation. The designation "negative" or "minus" can refer to the antisense strand, 
and the designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term 'biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic NTRAN, or of any oligopeptide 

20 thereof, to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary' describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 
3'-TCA-5\ 

25 A "composition comprising a given polynucleotide" and a "composition comprising a given 

polypeptide" can refer to any composition containing the given polynucleotide or polypeptide. The 
composition may comprise a dry formulation or an aqueous solution. Compositions comprising 
polynucleotides encoding NTRAN or fragments of NTRAN maybe employed as hybridization probes. 
The probes may be stored in freeze-dried form and may be associated with a stabilizing agent such as 

30 a carbohydrate. In hybridizations, the probe may be deployed in an aqueous solution containing salts 
(e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardfs 
solution, dry milk, salmon sperm DNA, etc.). 
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"Consensus sequence" refers to a nucleic acid sequence which has been subjected to 
repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been 
assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a computer 

5 program for fragment assembly, such as the GELVEEW fragment assembly system (Accelrys, 

Burlington MA) or Phrap (University of Washington, Seattle WA). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 

10 protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which maybe substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 


15 


Arg 


HQs, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Ghi, His 


20 


Glu 


Asp, Gin, His 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




He 


Leu, Val 




Leu 


He, Val 


25 


Lys 


Arg, Gin, Glu 




Met 


Leu, He 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 


30 


Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 




Val 


lie, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
35 backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 
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The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
alky], acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 

5 one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

"Differential expression" refers to increased or upregulated; or decreased, downregulated, or 

10 absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 
exon may represent a structural or functional domain of the encoded protein, new proteins maybe 

15 assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 

A "fragment" is a unique portion of NTRAN or a polynucleotide encoding NTRAN which 
can be identical in sequence to, but shorter in length than, the parent sequence. A fragment may 
comprise up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. 

20 For example, a fragment may comprise from about 5 to about 1000 contiguous nucleotides or amino 
acid residues. A fragment used as a probe, primer, antigen, therapeutic molecule, or for other 
purposes, maybe at least 5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 
contiguous nucleotides or amino acid residues in length. Fragments maybe preferentially selected 
from certain regions of a molecule. For example, a polypeptide fragment may comprise a certain 

25 length of contiguous amino acids selected from the first 250 or 500 amino acids (or first 25% or 50%) 
of a polypeptide as shown in a certain defined sequence. Clearly these lengths are exemplary, and 
any length that is supported by the specification, including the Sequence Listing, tables, and figures, 
may be encompassed by the present embodiments. 

A fragment of SEQ ID NO:23-44 can comprise a region of unique polynucleotide sequence 

30 that specifically identifies SEQ ID NO:23-44, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:23-44 can be employed 
in one or more embodiments of methods of the invention, for example, in hybridization and 
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amplification technologies and in analogous methods that distinguish. SEQ ID NO:23-44 from related 
polynucleotides. The precise length of a fragment of SEQ ID NO:23-44 and the region of SEQ ID 
NO:23-44 to which the fragment corresponds are routinely determinable by one of ordinary skill in the 
art based on the intended purpose for the fragment. 
5 A fragment of SEQ ID NO: 1-22 is encoded by a fragment of SEQ ID NO:23-44. A 

fragment of SEQ ID NO: 1-22 can comprise a region of unique amino acid sequence that specifically 
identifies SEQ ID NO:l-22. For example, a fragment of SEQ ID NO:l-22 can be used as an 
immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1-22. 
The precise length of a fragment of SEQ ID NO:l-22 and the region of SEQ ID NO:l-22 to which 
10 the fragment corresponds can be determined based on the intended purpose for the fragment using 
one or more analytical methods described herein or otherwise known in the art. 

A "fall length" polynucleotide is one containing at least a translation initiation codon (e.g., 
methionine) followed by an open reading frame and a translation termination codon. A "full length" 
polynucleotide sequence encodes a "full length" polypeptide sequence. 
15 "Homology" refers to sequence similarity or, alternatively, sequence identity, between two or 

more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of identical residue matches between at least two polynucleotide sequences aligned 
using a standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, 
20 gaps in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using one or more 
computer algorithms or programs known in the art or described herein. For example, percent identity 
can be determined using the default parameters of the CLUSTAL V algorithm as incorporated into 
25 the MEGAUGN version 3 . 12e sequence alignment program. This program is part of the 

LASERGENE software package, a suite of molecular biological analysis programs (DNASTAR, 
Madison WI). CLUSTAL V is described in Higgins, D.G. and P.M. Sharp (1989; CABIOS 5:151- 
153) and in Higgins, D.G. et al. (1992; CABIOS 8:189-191). For pairwise alignments of 
polynucleotide sequences, the default parameters are set as follows: Ktuple=2, gap penalty^5, 
30 window=4, and "diagonals saved"=4. The "weighted" residue weight table is selected as the default 
Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
which can be used is provided by the National Center for Biotechnology Information (NCBI) Basic 



41 



WO 03/051902 



PCT/US02/40059 



Local Alignment Search Tool (BLAST) (Altschul, S.R et al. (1990) J. MoL Biol. 215:403-410), which 
is available from several sources, including the NCBI, Bethesda, MD, and on the Internet at 
http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
5 polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
10 compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) set at default parameters. Such default parameters maybe, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 
15 Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 

20 Percent identity may be measured over the length of an entire defined sequence, for example, 

as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported 

25 by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a 
length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 

30 sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of identical residue matches between at least two polypeptide sequences aligned using 
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a standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the charge and hydrophobicity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. The 
5 phrases "percent similarity 7 ' and "% similarity," as applied to polypeptide sequences, refer to the 
percentage of residue matches, including identical residue matches and conservative substitutions, 
between at least two polypeptide sequences aligned using a standardized algorithm. In contrast, 
conservative substitutions are not included in the calculation of percent identity between polypeptide 
sequences. 

10 Percent identity between polypeptide sequences may be determined using the default 

parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved' =5. The PAM250 matrix is selected as the default 
1 5 residue weight table. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the ce BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) wifhblastp set at default parameters. Such default parameters maybe, for 
example: 
20 Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 3 
25 Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ED number, or may be measured over a shorter length, 
for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
30 150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 
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4 'Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
5 sequence in the non- antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

''Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
10 Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the '^washing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in 
15 the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 /ig/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
20 under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. and D.W. 
25 Russell (2001; Molecular Cloning: A Laboratory Manual , 3rd ed., vol. 1-3, Cold Spring Harbor Press, 
Cold Spring Harbor NY, ch. 9). 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C maybe used. SSC concentration may 
30 be varied from about 0. 1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 jig/ml. Organic solvent, such as 
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formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNArDNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 

5 strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acids by 
virtue of the formation of hydrogen bonds between complementary bases. A hybridization complex 
may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one nucleic acid present in 
solution and another nucleic acid immobilized on a solid support (e.g., paper, membranes, filters, chips, 

10 pins or glass slides, or any other appropriate substrate to which cells or their nucleic acids have been 
fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or polynucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 
"Immune response" can refer to conditions associated with inflammation, trauma, immune 
15 disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of NTRAN which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
20 mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment • 
of NTRAN which is useful in any of the antibody production methods disclosed herein or known in the 
art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, 

polypeptides, antibodies, or other chemical compounds on a substrate. 
25 The terms "element" and "array element" refer to a polynucleotide, polypeptide, antibody, or 

other chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of NTRAN. For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 

functional, or immunological properties of NTRAN. 
30 The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 

polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 

synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
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antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 

5 sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

'Teptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 

10 ' preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an NTRAN may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art. These processes may occur synthetically or biochemically. Biochemical modifications will vary 

15 by cell type depending on the enzymatic milieu of NTRAN. 

"Probe" refers to nucleic acids encoding NTRAN, their complements, or fragments thereof, 
which are used to detect identical, allelic or related nucleic acids. Probes are isolated oligonucleotides 
or polynucleotides attached to a detectable label or reporter molecule. Typical labels include 
radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are short nucleic 

20 acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 

complementary base-pairing. The primer may then be extended along the target DNA strand by a 
DNA polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic 
acid, e.g., by the polymerase chain reaction (PGR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 

25 nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, maybe used. 

30 Methods for preparing and using probes and primers are described in, for example, Sambrook, 

J. and D.W. Russell (2001; Molecular Cloning: A Laboratory Manual . 3rd ed., vol 1-3, Cold Spring 
Harbor Press, Cold Spring Harbor NY), Ausubel, F.M. et aL (1999; Short Protocols in Molecular 
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Biology , 4 th ed., John Wiley & Sons, New York NY), and Innis, M. et aL (1990; PCR Protocols, A 
Guide to Methods and Applications , Academic Press, San Diego CA). PCR primer pairs can he 
derived from a known sequence, for example, hy using computer programs intended for that purpose 
such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 

5 Oligonucleotides for use as primers are selected using software known in the art for such 

purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 

10 primer selection program (available to the public from the Genome Center at University of Texas 
South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 
sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 primer 
selection program (available to the public from the Whitehead Institute/MIT Center for Genome 
Research, Cambridge MA) allows the user to input a "mispriming library," in which sequences to 

15 avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of 
oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 
also be obtained from their respective sources and modified to meet the user's specific needs.) The . 
PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource 
Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 

20 selection of primers that hybridize to either the most conserved or least conserved regions of aligned 
nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved 
oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments 
identified by any of the above selection methods are useful in hybridization technologies, for example, 
as PCR or sequencing primers, microairay elements, or specific probes to identify fully or partially 

25 complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are 
not limited to those described above. 

A "recombinant nucleic acid" is a nucleic acid that is not naturally occurring or has a 
sequence that is made by an artificial combination of two or more otherwise separated segments of 
sequence. This artificial combination is often accomplished by chemical synthesis or, more commonly, 

30 by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering 

techniques such as those described in Sambrook and Russell (supra). The term recombinant includes 
nucleic acids that have been altered solely by addition, substitution, or deletion of a portion of the 
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nucleic acid. Frequently, a recombinant nucleic acid may include a nucleic acid sequence operably 
linked to a promoter sequence. Such a recombinant nucleic acid may be part of a vector that is used, 
for example, to transform a cell. 

Alternatively, such recombinant nucleic acids maybe part of a viral vector, e.g., based on a 
5 vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
10 translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 
15 An "RNA equivalent," in reference to a DNA molecule, is composed of the same linear 

sequence of nucleotides as the reference DNA molecule with the exception that all occurrences of 
the nitrogenous base thymine are replaced with uracil, and the sugar "backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing NTRAN, 
20 nucleic acids encoding NTRAN, or fragments thereof may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, 
in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
25 synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the 
epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A and the 
antibody will reduce the amount of labeled A that binds to the antibody. 
30 The term "substantially purified" refers to nucleic acid or amino acid sequences that are 

removed from their natural environment and are isolated or separated, and are at least about 60% 
free, preferably at least about 75% free, and most preferably at least about 90% free from other 
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components with which they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 

5 chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" or "expression profile" refers to the collective pattern of gene expression 
by a particular cell type or tissue under given conditions at a given time. 

10 "Transformation" describes a process by which exogenous DNA is introduced into a recipient 

cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 
on the type of host cell being transformed and may include, but is not limited to, bacteriophage or viral 

15 infection, electroporation, heat shock, lipofection, and particle bombardment. The term "transformed 
cells" includes stably transformed cells in which the inserted DNA is capable of replication either as 
an autonomously replicating plasmid or as part of the host chromosome, as well as transiently 
transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to animals 

20 and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 

introduced by way of human intervention, such as by transgenic techniques well known in the art. The 
nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. In another embodiment, the nucleic acid can be introduced by infection with a 

25 recombinant viral vector, such as a lentiviral vector (Lois, C et aL (2002) Science 295:868-872). The 
term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but rather 
is directed to the introduction of a recombinant DNA molecule. The transgenic organisms 
contemplated in accordance with the present invention include bacteria, cyanobacteria, fungi, plants 
and animals. The isolated DNA of the present invention can be introduced into the host by methods 

30 known in the art, for example infection, transfection, transformation or transconjugation. Techniques 
for transferring the DNA of the present invention into such organisms are widely known and provided 
in references such as Sambrook and Russell (supra). 
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A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 

5 least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice " "species," or "polymorphic" variant. A splice variant may have 
significant identity to a reference molecule, but will generally have a greater or lesser number of 

10 polynucleotides due to alternate splicing of exons during niRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotides that vary from one species to another. The 
resulting polypeptides will generally have significant amino acid identity relative to each other. A 
polymorphic variant is a variation in tiie polynucleotide sequence of a particular gene between 

15 individuals of a given species. Polymorphic variants also may encompass "single nucleotide 

polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 

20 at least 40% sequence identity or sequence similarity to the particular polypeptide sequence over a 
certain length of one of the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool 
Version 2.0.9 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for 
example, at least 50%, at least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, 

25 or at least 99% or greater sequence identity or sequence similarity over a certain defined length of one 
of the polypeptides. 

THE INVENTION 

Various embodiments of the invention include new human neurotransmission-associated 
30 proteins (NTRAN), the polynucleotides encoding NTRAN, and the use of these compositions for the 
diagnosis, treatment, or prevention of autoimmune/inflammatory, cardiovascular, neurological, 
developmental, cell proliferative, transport, psychiatric, metabolic, and endocrine disorders. 
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Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
embodiments of the invention. Each polynucleotide and its corresponding polypeptide are correlated to 
a single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an 

5 Incyte polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide 

sequence is denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ ID 
NO:) and an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown. 
Column 6 shows the Incyte ID numbers of physical, full length clones corresponding to the polypeptide 
and polynucleotide sequences of the invention. The full length clones encode polypeptides which have 

10 at least 95% sequence identity to the polypeptide sequences shown in column 3. 

Table 2 shows sequences with homology to polypeptide embodiments of the invention as 
identified by BLAST analysis against the GenBank protein (genpept) database and the PROTEOME 
database. Columns 1 and 2 show the polypeptide sequence identification number (Polypeptide SEQ 
ID NO:) and the corresponding Incyte polypeptide sequence number (Incyte Polypeptide ID) for 

15 polypeptides of the invention. Column 3 shows the GenBank identification number (GenBank ID NO:) 
of the nearest GenBank homolog and the PROTEOME database identification numbers 
(PROTEOME ID NO:) of the nearest PROTEOME database homologs. Column 4 shows the 
probability scores for the matches between each polypeptide and its homolog(s). Column 5 shows the 
annotation of the GenBank and PROTEOME database homolog(s) along with relevant citations where 

20 applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 and 

2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. Column 

3 shows the number of amino acid residues in each polypeptide. Column 4 shows amino acid residues 
25 comprising signature sequences, domains, motifs, potential phosphorylation sites, and potential 

gjycosylation sites. Column 5 shows analytical methods for protein structure/function analysis and in 
some cases, searchable databases to which the analytical methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are neurotransmission-associated proteins. For 
30 example, For example, SEQ ID NO:5 is 68% identical, from residue E43 to residue D287, and is 60% 
identical, from residue Ml to G118 to murine gliacolin (GenBank ID gl4278927) as determined by the 
Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability scores are 
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3.0e-86 and 5.6e-30, which indicate the probabilities of obtaining the observed polypeptide sequence 
alignments by chance. As determined by BLAST analysis using the PROTEOME database, SEQ ED 
NO:5 also has homology to murine and human Clq-related factors which have a collagenous region 
and a globular domain and have similarity to the Clq signature domain. Both murine and human 

5 Clq-related factors are highly expressed in brain areas that are involved in motor function 

(PROTEOME ID 429678|Clqrf and 567880|CRF). SEQ ID NO:5 also contains a Clq domain and a 
collagen triple helix repeat (20 copies) domain as determined by searching for statistically significant 
matches in the hidden Markov model (HMM)-based PFAM database of conserved protein family 
domains. (See Table 3.) Data from BLIMPS, MOTIFS, PROEELESCAN, and additional BLAST 

10 analyses provide further corroborative evidence that SEQ ID NO:5 is a Clq-related factor. 

In an alternative example, SEQ ID NO:9 is 86% identical, from residue Ml to residue F1308, 
to Mus musculus cell recognition molecule CASPR4 (GenBank ID gl2330704) as determined by the 
Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 0.0, 
which indicates the probability of obtaining the observed polypeptide sequence alignment by chance. 

15 SEQ ID NO:9 also has homology to Neurexin 4 (contactin associated protein 1), a neuronal paranodal 
transmembrane receptor containing epidermal growth factor-like and neurexin motifs, as deteimined 
by BLAST analysis using the PROTEOME database. SEQ ID NO:9 also contains an EGF-like 
domain and a F5/8 type C domain as determined by searching for statistically significant matches in . 
the hidden Markov model (HMM)-based PFAM database of conserved protein family domains. (See 

20 Table 3.) Data from BLIMPS and MOTIFS analyses provide further corroborative evidence that 
SEQ ID NO:9 is a cell recognition molecule. 

In an alternative example, SEQ ID NO:13 is 100% identical, from residue Ml to residue S573, 
to human myelin-associated glycoprotein precursor (GenBank ID g307156) as determined by the 
Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 0.0, 

25 which indicates the probability of obtaining the observed polypeptide sequence alignment by chance. 
SEQ ID NO: 13 also has homology to proteins that are localized to the nervous system, are members 
of the sialoadhesin subgroup of immunoglobulin superfamily lectins, and are cell adhesion molecules, as 
determined by BLAST analysis using the PROTEOME database. SEQ ID NO:13 also contains an I- 
type IG domain, an Immunoglobulin domain, an Ig superfamily from SCOP domain and an 

30 Immunoglobulin C-2 type domain as determined by searching for statistically significant matches in 
the hidden Markov model (HMM)-based PFAM, INCY and SMART databases of conserved protein 
families/domains. (See Table 3.) Data from BLIMPS, MOTIFS, and additional BLAST analyses 



52 



WO 03/051902 



PCT/US02/40059 



provide further corroborative evidence that SEQ ID NO: 13 is a myelin cell adhesion molecule. 

In an alternative example, SEQ ID NO: 16 is 93% identical, from residue A32 to residue S412, 
to human ubiquilin (GenBankID gl8254511) as determined by the Basic Local Alignment Search Tool 
(BLAST). (See Table 2.) The BLAST probability score is 4.6e-185, which indicates the probability 

5 of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 16 also has 
homology to proteins that are localized to the endoplasmic reticulum, nucleus, and cytoplasm, is 
associated with the 26S proteasome, binds to and promotes the accumulation of presenilin 1 and 
presenilin 2, and are localized to neurofibrillary tangles and Lewy bodies in brains affected by 
Alzheimer's disease and Parkinson's disease, and are annotated as ubiquilin 1 proteins, as determined 

10 by BLAST analysis using the PROTEOME database. SEQ ID NO:16 also contains UBA/TS-N and 
ubiquitin associated domains as determined by searching for statistically significant matches in the 
hidden Markov model (HMM)-based PFAM and SMART database of conserved protein 
families/domains. (See Table 3.) Data from BLAST analyses against the PRODOM database, 
provide further corroborative evidence that SEQ ID NO: 16 is a ubiquilin protein. 

15 In an alternative example, SEQ ID NO: 19 is 89% identical, from residue Ml to residue L621, 

to Mus musculus punc (g3068592) as determined by the Basic Local Alignment Search Tool 
(BLAST). The BLAST probability score is 0.0, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO:19 also has homology to proteins . 
that are localized to the plasma membrane, are members of the immunoglobulin superfamily, and are 

20 neuronal cell adhesion proteins, as determined by BLAST analysis using the PROTEOME database. 
SEQ ID NO: 19 also contains fibronectin type 3 and immunoglobulin domains as determined by 
searching for statistically significant matches in the hidden Markov model (HMM)-based PFAM, 
INCY, and SMART databases of conserved protein families/domains. (See Table 3.) Data from 
BLAST analyses against the PRODOM and DOMO databases, provide further corroborative 

25 evidence that SEQ ID NO:19 is a neuronal cell adhesion protein. 

SEQ ID NO:l-4, SEQ ID NO:6-8, SEQ ID NO:10-12, SEQ ID NO:14-15, SEQ ID NO:17- 
18, and SEQ ID NO:20-22 were analyzed and annotated in a similar manner. The algorithms and 
parameters for the analysis of SEQ ID NO: 1-22 are described in Table 7. 

As shown in Table 4, the full length polynucleotide embodiments were assembled using cDNA 

30 sequences or coding (exon) sequences derived from genomic DNA, or any combination of these two 
types of sequences. Column 1 lists the polynucleotide sequence identification number (Polynucleotide 
SEQ ID NO:), the corresponding Incyte polynucleotide consensus sequence number (Incyte ID) for 
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each polynucleotide of the invention, and the length of each polynucleotide sequence in basepairs. 
Column 2 shows the nucleotide start (5') and stop (3') positions of the cDNA and/or genomic 
sequences used to assemble the full length polynucleotide embodiments, and of fragments of the 
polynucleotides which are useful, for example, in hybridization or amplification technologies that 

5 identify SEQ ID NO:23-44 or that distinguish between SEQ ID NO:23-44 and related polynucleotides. 
The polynucleotide fragments described in Column 2 of Table 4 may refer specifically, for 
example, to Incyte cDNAs derived from tissue-specific cDNA libraries or from pooled cDNA 
libraries. Alternatively, the polynucleotide fragments described in column 2 may refer to GenBank 
cDNAs or ESTs which contributed to the assembly of the foil length polynucleotides. In addition, the 

10 polynucleotide fragments described in column 2 may identify sequences derived from the ENSEMBL 
(The Sanger Centre, Cambridge, UK) database (i.e., those sequences including the designation 
"ENST"). Alternatively, the polynucleotide fragments described in column 2 maybe derived from the 
NCBI RefSeq Nucleotide Sequence Records Database (jLe. , those sequences including the 
designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records those sequences 

15 including the designation <£ NF'). Alternatively, the polynucleotide fragments described in column 2 
may refer to assemblages of both cDNA and Genscan-predicted exons brought together by an "exon 
stitching" algorithm. For example, a polynucleotide sequence identified as 
m,JOOOOQCJi 1 _N 2 _YYYYY_N 3 _N 4 represents a "stitched'* sequence in which XXXXXX is the, 
identification number of the cluster of sequences to which the algorithm was applied, and YYYYY is the 

20 number of the prediction generated by the algorithm, and N It2 ^ 9 if present, represent specific exons 
that may have been manually edited during analysis (See Example V). Alternatively, the 
polynucleotide fragments in column 2 may refer to assemblages of exons brought together by an 
"exon-stxetching" algorithm. For example, a polynucleotide sequence identified as 
FLXXXXXX_gAAAAA^BBBBJ._N is a "stretched" sequence, with XXXXXX being the Incyte 

25 project identification number, gAAAAA being the GenBank identification number of the human 

genomic sequence to which the "exon-stretching" algorithm was applied, gfiBBBB being the GenBank 
identification number or NCBI RefSeq identification number of the nearest GenBank protein homolog, 
and N referring to specific exons (See Example V). In instances where a RefSeq sequence was used 
as a protein homolog for the "exon-stretching" algorithm, a RefSeq identifier (denoted by "NM," 

30 "NP," or "NT") may be used in place of the GenBank identifier (i. e. , gfiBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted from 
genomic DNA sequences, or derived from a combination of sequence analysis methods. The 
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following Table lists examples of component sequence prefixes and corresponding sequence analysis 
methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK) 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 


INCY 


Full length transcript and exon prediction from mapping of EST 
sequences to the genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript. 



10 In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 

Table 4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant Incyte 
cDNA identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotides which 
were assembled using Incyte cDNA sequences. The representative cDNA library is the Incyte 

15 cDNA library which is most frequently represented by the Incyte cDNA sequences which were used 
to assemble and confirm the above polynucleotides. The tissues and vectors which were used to 
construct the cDNA libraries shown in Table 5 are described in Table 6. 

Table 8 shows single nucleotide polymorphisms (SNPs) found in polynucleotide sequences of 
the invention, along with allele frequencies in different human populations. Columns 1 and 2 show the 

20 polynucleotide sequence identification number (SEQ ID NO:) and the corresponding Incyte project 
identification number (PID) for polynucleotides of the invention. Column 3 shows the Incyte 
identification number for the EST in which the SNP was detected (EST ID), and column 4 shows the 
identification number for the SNP (SNP ID). Column 5 shows the position within the EST sequence 
at which the SNP is located (EST SNP), and column 6 shows the position of the SNP within the full- 

25 length polynucleotide sequence (CB1 SNP). Column 7 shows the allele found in the EST sequence. 
Columns 8 and 9 show the two alleles found at the SNP site. Column 10 shows the amino acid 
encoded by the codon including the SNP site, based upon the allele found in the EST. Columns 11-14 
show the frequency of allele 1 in four different human populations. An entry of n/d (not detected) 
indicates that the frequency of allele 1 in the population was too low to be detected, while n/a (not 
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available) indicates that the allele frequency was not determined for the population. 

The invention also encompasses NTRAN variants. Various embodiments of NTRAN 
variants can have at least about 80%, at least about 90%, or at least about 95% amino acid sequence 
identity to the NTRAN amino acid sequence, and can contain at least one functional or structural 

5 characteristic of NTRAN. 

Various embodiments also encompass polynucleotides which encode NTRAN. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO:23-44, which encodes NTRAN. The polynucleotide 
sequences of SEQ ID NO:23-44, as presented in the Sequence Listing, embrace the equivalent RNA 

10 sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses variants of a polynucleotide encoding NTRAN. In particular, 
such a variant polynucleotide will have at least about 70%, or alternatively at least about 85%, or even 
at least about 95% polynucleotide sequence identity to a polynucleotide encoding NTRAN. A 

15 particular aspect of the invention encompasses a variant of a polynucleotide comprising a sequence 
selected from the group consisting of SEQ ID NO:23-44 which has at least about 70%, or alternatively 
at least about 85%, or even at least about 95% polynucleotide sequence identity to a nucleic acid 
sequence selected from the group consisting of SEQ ID NO:23-44. Any one of the polynucleotide 
variants described above can encode a polypeptide which contains at least one functional or structural 

20 characteristic of NTRAN. 

In addition, or in the alternative, a polynucleotide variant of the invention is a splice variant of a 
polynucleotide encoding NTRAN. A splice variant may have portions which have significant 
sequence identity to a polynucleotide encoding NTRAN, but will generally have a greater or lesser 
number of polynucleotides due to additions or deletions of blocks of sequence arising from alternate 

25 splicing of exons during mRNA processing. A splice variant may have less than about 70%, or 
alternatively less than about 60%, or alternatively less than ahout 50% polynucleotide sequence 
identity to a polynucleotide encoding NTRAN over its entire length; however, portions of the splice 
variant will have at least about 70%, or alternatively at least about 85%, or alternatively at least about 
95%, or alternatively 100% polynucleotide sequence identity to portions of the polynucleotide encoding 

30 NTRAN. For example, a polynucleotide comprising a sequence of SEQ ID NO:33 and a 

polynucleotide comprising a sequence of SEQ ID NO:34 are splice variants of each other. Any one 
of the splice variants described above can encode a polypeptide which contains at least one functional 
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or structural characteristic of NTRAN. 

It wifl be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding NTRAN, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, maybe 
5 produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring NTRAN, and all such variations are to be considered 
as being specifically disclosed. 
10 Although polynucleotides which encode NTRAN and its variants are generally capable of 

hybridizing to polynucleotides encoding naturally occurring NTRAN under appropriately selected 
conditions of stringency, it maybe advantageous to produce polynucleotides encoding NTRAN or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 
15 particular prokaryotic or eukaryotic host in accordance with the frequency with which particular 
codons are utilized by the host. Other reasons for substantially altering the nucleotide sequence 
encoding NTRAN and its derivatives without altering the encoded amino acid sequences include the 
production of RNA transcripts having more desirable properties, such as a greater half-life, than . 
transcripts produced from the naturally occurring sequence. 
20 The invention also encompasses production of polynucleotides which encode NTRAN and 

NTRAN derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic polynucleotide maybe inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry maybe used to 
introduce mutations into a polynucleotide encoding NTRAN or any fragment thereof. 
25 Embodiments of the invention can also include polynucleotides that are capable of hybridizing 

to the claimed polynucleotides, and, in particular, to those having the sequences shown in SEQ ID 
NO:23-44 and fragments thereof, under various conditions of stringency (Wahl, G.M. and S.L. Berger 
(1987) Methods EnzymoL 152:399-407; Kimmel, A.R. (1987) Methods EnzymoL 152:507-511). 
Hybridization conditions, including annealing and wash conditions, are described in 'Ttefinitions." 
30 Methods for DNA sequencing are well known in the art and may be used to practice any of 

the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
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Biosystems), thermostable T7 polymerase (Amersham Biosciences, Piscataway NJ), or combinations 
of polymerases and proofreading exonucleases such as those found in the ELONGASE amplification 
system (Invitrogen, Carlsbad CA). Preferably, sequence preparation is automated with machines 
such as the NUCROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal cycler 

5 (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Applied Biosystems). 
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Applied 
Biosystems), the MEGABACE 1000 DNA sequencing system (Amersham Biosciences), or other 
systems known in the art. The resulting sequences are analyzed using a variety of algorithms which 
are well known in the art (Ausubel et al, supra, ch. 7; Meyers, R.A. (1995) Molecular Biology and 

10 Biotechnology , Wiley VCH, New York NY, pp. 856-853). 

The nucleic acids encoding NTRAN maybe extended utilizing a partial nucleotide sequence 
and employing various PCR-based methods known in the art to detect upstream sequences, such as 
promoters and regulatory elements. For example, one method which may be employed, restriction-site 
PCR, uses universal and nested primers to amplify unknown sequence from genomic DNA within a 

15 cloning vector (Sarkar, G. (1993) PCR Methods Applic. 2:318-322). Another method, inverse PCR, 
uses primers that extend in divergent directions to amplify unknown sequence from a circularized 
template. The template is derived from restriction fragments comprising a known genomic locus and 
surrounding sequences (Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186)., A third method, capture 
PCR, involves PCR amplification of DNA fragments adjacent to known sequences in human and 

20 yeast artificial chromosome DNA (Lagerstrom, M. et al. (199 1) PCR Methods Applic. 1:11 1-1 19). In 
this method, multiple restriction enzyme digestions and ligations maybe used to insert an engineered 
double-stranded sequence into a region of unknown sequence before performing PCR. Other 
methods which may be used to retrieve unknown sequences are known in the art (Parker, J.D. et al. 
(1991) Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR, nested primers, and 

25 PROMOTERFINDER libraries (Clontech, Palo Alto CA) to walk genomic DNA. This procedure 
avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based 
methods, primers maybe designed using commercially available software, such as OLIGO 4.06 
primer analysis software (National Biosciences, Plymouth MN) or another appropriate program, to be 
about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 

30 template at temperatures of about 68°C to 72°C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
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sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 

library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 

into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available maybe used to analyze 
5 the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 

sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 

specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 

emitted wavelengths. Output/light intensity maybe converted to electrical signal using appropriate 

software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
10 process from loading of samples to computer analysis and electronic data display maybe computer 

controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 

which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotides or fragments thereof which encode 

NTRAN may be cloned in recombinant DNA molecules that direct expression of NTRAN, or 
15 fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy 

of the genetic code, other polynucleotides which encode substantially the same or a functionally 

; equivalent polypeptides may be produced and used to express NTRAN. 

The polynucleotides of the invention can be engineered using methods generally known in the 

art in order to alter NTRAN-encoding sequences for a variety of purposes including, but not limited to, 
20 modification of the cloning, processing, and/or expression of the gene product. DNA shuffling by 

random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be 

i 

used to engineer the nucleotide sequences. For example, oligonucleotide-mediated site-directed 
mutagenesis maybe used to introduce mutations that create new restriction sites, alter glycosylation 
patterns, change codon preference, produce splice variants, and so forth. 

25 The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 
5,837,458; Chang, C.-C. et al. (1999) Nat BiotechnoL 17:793-797; Christians, F.C. et aL (1999) Nat. 
BiotechnoL 17:259-264; and Crameri, A. et al. (1996) Nat. BiotechnoL 14:315-319) to alter or improve 
the biological properties of NTRAN, such as its biological or enzymatic activity or its ability to bind to 

30 other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. The library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
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preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutations may be 
recombined, screened, and then reshuffled until the desired properties are optimized. Alternatively, 

5 fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
family, either from the same or different species, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

In another embodiment, polynucleotides encoding NTRAN maybe synthesized, in whole or in 
part, using one or more chemical methods well known in the art (Caruthers, M.H. et al. (1980) 

10 Nucleic Acids Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232). 
Alternatively, NTRAN itself or a fragment thereof may be synthesized using chemical methods known 
in the art. For example, peptide synthesis can be performed using various solution-phase or 
solid-phase techniques (Creighton, T. (1984) Proteins, Structures and Molecular Properties , WH 
Freeman, New York NY, pp. 55-60; Roberge, J.Y. et aL (1995) Science 269:202-204). Automated 

15 synthesis may be achieved using the ABI 43 1 A peptide synthesizer (Applied Biosystems). 

Additionally, the amino acid sequence of NTRAN, or any part thereof, maybe altered during direct 
synthesis and/or combined with sequences from other proteins, or any part thereof, to produce a 
variant polypeptide or a polypeptide having a sequence of a naturally occurring polypeptide. 
The peptide may be substantially purified by preparative high performance liquid 

20 chromatography (Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421). The 
composition of the synthetic peptides maybe confirmed by amino acid analysis or by sequencing 
(Creighton, supra, pp. 28-53). 

In order to express a biologically active NTRAN, the polynucleotides encoding NTRAN or 
derivatives thereof maybe inserted into an appropriate expression vector, i.e., a vector which contains 

25 the necessary elements for transcriptional and translational control of the inserted coding sequence in 
a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3' untranslated regions in the vector and in polynucleotides encoding 
NTRAN. Such elements may vary in their strength and specificity. Specific initiation signals may 
also be used to achieve more efficient translation of polynucleotides encoding NTRAN. Such signals 

30 include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where a 
polynucleotide sequence encoding NTRAN and its initiation codon and upstream regulatory sequences 
are inserted into the appropriate expression vector, no additional transcriptional or translational control 
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signals maybe needed. However, in cases where only coding sequence, or a fragment thereof, is 
inserted, exogenous translational control signals including an in-frame ATG initiation codon should be 
provided by the vector. Exogenous translational elements and initiation codons may be of various 
origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of 

5 enhancers appropriate for the particular host cell system used (Scharf, D. et aL (1994) Results Probl. 
Cell Differ. 20:125-162). 

Methods which are well known to those skilled in the art maybe used to construct expression 
vectors containing polynucleotides encoding NTRAN and appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 

10 and in vivo genetic recombination (Sambrook and Russell, supra, ch. 1-4, and 8; Ausubel et aL, 
supra, ch. 1, 3, and 15). 

A variety of expression vector/host systems maybe utilized to contain and express 
polynucleotides encoding NTRAN. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; 

15 yeast transformed with yeast expression vectors; insect cell systems infected with viral expression 
vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., 
cauliflower mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors 
(e.g., Ti or pBR322 plasmids); or animal cell systems (Sambrook and Russell, supra; Ausubel et aL, 
supra; VanHeeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, RK. et aL 

20 (1994) Proc. NatL Acad. Sci. USA 91:3224-3227; Sandig, V. et aL (1996) Hum. Gene Ther. 7:1937- 
1945; Takamatsu, N. (1987) EMBO J. 6:307-311; The McGraw Hill Yearbook of Science and 
Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659; Harrington, J.J. et aL (1997) Nat. Genet. 15:345-355). 
Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from 

25 various bacterial plasmids, may be used for delivery of polynucleotides to the targeted organ, tissue, or 
cell population pi Nicola, M. et al. (1998) Cancer Gen. Ther. 5:350-356; Yu, M. et aL (1993) Proc. 
Natl. Acad. Sci. USA 90:6340-6344; Buller, R.M. et al. (1985) Nature 317:813-815; McGregor, D.P. 
et al. (1994) Mol. Immunol. 31:219-226; Verma, LM. and N. Somia (1997) Nature 389:239-242). The 
invention is not limited by the host cell employed. 

30 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon the use intended for polynucleotides encoding NTRAN. For example, routine cloning, 
subcloning, and propagation of polynucleotides encoding NTRAN can be achieved using a 
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multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (Invitrogen). Ligation of polynucleotides encoding NTRAN into the vector's multiple cloning 
site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 

5 in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence (VanHeeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509). When large quantities of NTRAN are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of NTRAN may be used. For example, vectors containing 
the strong, inducible SP6 or T7 bacteriophage promoter may be used. 

10 Yeast expression systems may be used for production of NTRAN. A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, maybe used in the yeast SaccharortTyces cerevisiae or Pichia pastoris. In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign polynucleotide sequences into the host genome for stable propagation (Ausubel et al., 

15 supra; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; Scorer, CA. et al. (1994) 
Bio/Technology 12:181-184). 

Plant systems may also be used for expression of NTRAN. Transcription of polynucleotides 
encoding NTRAN may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 

20 6:307-3 1 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters maybe used (Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) 
Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105). These constructs 
can be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection 
(The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 

25 191-196). 

In mammalian cells, a number of viral-based expression systems maybe utilized. In cases 
where an adenovirus is used as an expression vector, polynucleotides encoding NTRAN maybe 
ligated into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 
30 obtain infective virus which expresses NTRAN in host cells (Logan, J. and T. Shenk (1984) Proc. 
Natl Acad. Sci. USA 81:3655-3659). In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EB V- 
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based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 

5 or vesicles) for therapeutic purposes (Harrington, J.J. et al. (1997) Nat Genet. 15:345-355). 

For long term production of recombinant proteins in mammalian systems, stable expression of 
NTRAN in cell lines is preferred. For example, polynucleotides encoding NTRAN can be 
transformed into cell lines using expression vectors which may contain viral origins of replication 
and/or endogenous expression elements and a selectable marker gene on the same or on a separate 

10 vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 days in 
enriched media before being switched to selective media. The purpose of the selectable marker is to 
confer resistance to a selective agent, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed cells maybe 
propagated using tissue culture techniques appropriate to the cell type. 

15 Any number of selection systems maybe used to recover transformed cell lines. These 

include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tkr and apr cells, respectively (Wigler, M. et al. (1977) 
Cell 11:223-232; Lowy, I. et al. (1980) Cell 22:817-823). Also, antimetabolite, antibiotic, or herbicide 
resistance can be used as the basis for selection. For example, dhfr confers resistance to 

20 methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and ah and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Wigler, M. et al. 
(1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 
150:1-14). Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular 
requirements for metabolites (Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 

25 85:8047-8051). Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), (}- 
glucuronidase and its substrate P-glucuronide, or luciferase and its substrate luciferin maybe used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system (Rhodes, C.A. (1995) 
Methods Mol. Biol. 55:121-131). 

30 Although the presence/absence of marker gene expression suggests that the gene of interest 

is also present, the presence and expression of the gene may need to be confirmed. For example, if 
the sequence encoding NTRAN is inserted within a marker gene sequence, transformed cells 



63 



WO 03/051902 



PCT/US02/40059 



containing polynucleotides encoding NTRAN can be identified by the absence of marker gene 
function. Alternatively, a marker gene can be placed in tandem with a sequence encoding NTRAN 
under the control of a single promoter. Expression of the marker gene in response to induction or 
selection usually indicates expression of the tandem gene as well. 

5 In general, host cells that contain the polynucleotide encoding NTRAN and that express 

NTRAN may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

10 Immunological methods for detecting and measuring the expression of NTRAN using either 

specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on NTRAN is preferred, but a 

15 competitive binding assay may be employed. These and other assays are well known in the art 

(Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual , APS Press, St Paul MN, Sect. 
IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and Wiley- 
Interscience, New York NY; Pound, J.D. (1998) Immunochemical Protocols , Humana Press, Totowa , 
NJ). 

20 A wide variety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding NTRAN include 
oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, polynucleotides encoding NTRAN, or any fragments thereof, maybe cloned into a 

25 vector for the production of an mRNA probe. Such vectors are known in the art, are commercially 
available, and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA 
polymerase such as T7, T3, or SP6 and labeled nucleotides. These procedures maybe conducted 
using a variety of commercially available kits, such as those provided by Amersham Biosciences, 
Promega (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be 

30 used for ease of detection include radionuclides, enzymes, fluorescent, chemihiminescent, or 
chromogenic agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 
Host cells transformed with polynucleotides encoding NTRAN may be cultured under 
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conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell maybe secreted or retained intraceflularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode NTRAN may be designed to contain signal sequences which direct 

5 secretion of NTRAN through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted polynucleotides or to process the expressed protein in the desired fashion. Such modifications 
of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 

10 "pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WB8) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

15 In another embodiment of the invention, natural, modified, or recombinant polynucleotides 

encoding NTRAN may be ligated to a heterologous sequence resulting in translation of a fusion 
protein in any of the aforementioned host systems. For example, a chimeric NTRAN protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may , 
facilitate the screening of peptide libraries for inhibitors of NTRAN activity. Heterologous protein and 

20 peptide moieties may also facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 
binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 
hemaggfrtinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 

25 respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
located between the NTRAN encoding sequence and the heterologous protein sequence, so that 
NTRAN may be cleaved away from the heterologous moiety following purification. Methods for 

30 fusion protein expression and purification are discussed in Ausubel et al. (supra, ch. 10 and 16). A 
variety of commercially available kits may also be used to facilitate expression and purification of 
fusion proteins. 
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In another embodiment, synthesis of radiolabeled NTRAN maybe achieved in viti'o using the 
TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems couple 
transcription and translation of protein-coding sequences operably associated with the T7, T3, or SP6 
promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
5 example, 35 S -methionine. 

NTRAN, fragments of NTRAN, or variants of NTRAN maybe used to screen for 
compounds that specifically bind to NTRAN. One or more test compounds maybe screened for 
specific binding to NTRAN. In various embodiments, 1, 2, 3, 4, 5, 10, 20, 50, 100, or 200 test 
compounds can be screened for specific binding to NTRAN. Examples of test compounds can 
10 include antibodies, anticalins, oligonucleotides, proteins (e.g., ligands or receptors), or small molecules. 
In related embodiments, variants of NTRAN can be used to screen for binding of test 
compounds, such as antibodies, to NTRAN, a variant of NTRAN, or a combination of NTRAN and/or 
one or more variants NTRAN. In an embodiment, a variant of NTRAN can be used to screen for 
compounds that bind to a variant of NTRAN, but not to NTRAN having the exact sequence of a 
15 sequence of SEQ ID NO:l-22. NTRAN variants used to perform such screening can have a range 
of about 50% to about 99% sequence identity to NTRAN, with various embodiments having 60%, 
70%, 75%, 80%, 85%, 90%, and 95% sequence identity. 

In an embodiment, a compound identified in a screen for specific binding to NTRAN can be : 
closely related to the natural ligand of NTRAN, e.g., a ligand or fragment thereof, a natural substrate, 
20 a structural or functional mimetic, or a natural binding partner (Coligan, J.E. et al. (1991) Current 
Protocols in TrnTTiimolop y l(2):Chapter 5). In another embodiment, the compound thus identified can 
be a natural ligand of a receptor NTRAN (Howard, A.D. et al. (2001) Trends Pharmacol. Sci.22:132- 
140; Wise, A. et al. (2002) Drug Discovery Today 7 :235-246). 

In other embodiments, a compound identified in a screen for specific binding to NTRAN can 
25 be closely related to the natural receptor to which NTRAN binds, at least a fragment of the receptor, 
or a fragment of the receptor including all or a portion of the ligand binding site or binding pocket. For 
example, the compound may be a receptor for NTRAN which is capable of propagating a signal, or a 
decoy receptor for NTRAN which is not capable of propagating a signal (Ashkenazi, A. and V.M. 
Divit (1999) Curr. Opm. Cell Biol. 11:255-260; Mantovani, A. et al. (2001) Trends Immunol. 22:328- 
30 336). The compound can be rationally designed using known techniques. Examples of such 
techniques include those used to construct the compound etanercept (ENBREL; Amgen Inc., 
Thousand Oaks CA), which is efficacious for treating rheumatoid arthritis in humans. Etanercept is 
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an engineered p75 tumor necrosis factor (TNF) receptor dimer linked to the Fc portion of human IgGi 
(Taylor, P.C. et al. (2001) Curr. Opin. Immunol. 13:611-616). 

In one embodiment, two or more antibodies having similar or, alternatively, different 
specificities can be screened for specific binding to NTRAN, fragments of NTRAN, or variants of 

5 NTRAN. The binding specificity of the antibodies thus screened can thereby be selected to identify 
particular fragments or variants of NTRAN. In one embodiment, an antibody can be selected such 
that its binding specificity allows for preferential identification of specific fragments or variants of 
NTRAN. In another embodiment, an antibody can be selected such that its binding specificity allows 
for preferential diagnosis of a specific disease or condition having increased, decreased, or otherwise 

10 abnormal production of NTRAN. 

In an embodiment, anticalins can be screened for specific binding to NTRAN, fragments of 
NTRAN, or variants of NTRAN. Anticalins are ligand-binding proteins that have been constructed 
based on a lipocalin scaffold (Weiss, G.A. and H.B. Lowman (2000) Chem. Biol. 7:R177-R184; 
Skerra, A. (2001) J. Biotechnol. 74:257-275). The protein architecture of lipocalins can include a 

15 beta-barrel having eight antiparallel beta-strands, which supports four loops at its open end. These 
loops form the natural ligand-binding site of the lipocalins, a site which can be re-engineered in vitro 
by amino acid substitutions to impart novel binding specificities. The amino acid substitutions can be 
made using methods known in the art or described herein, and can include conservative substitutions . 
(e.g., substitutions that do not alter binding specificity) or substitutions that modestly, moderately, or 

20 significantly alter binding specificity. 

In one embodiment, screening for compounds which specifically bind to, stimulate, or inhibit 
NTRAN involves producing appropriate cells which express NTRAN, either as a secreted protein or 
on the cell membrane. Preferred cells can include cells from mammals, yeast, Drosophila, or E. coli. 
Cells expressing NTRAN or cell membrane fractions which contain NTRAN are then contacted with 

25 a test compound and binding, stimulation, or inhibition of activity of either NTRAN or the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fiuorophore, radioisotope, enzyme conjugate, or other detectable label. For example, the 
assay may comprise the steps of combining at least one test compound with NTRAN, either in 
30 solution or affixed to a solid support, and detecting the binding of NTRAN to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
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libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

An assay can be used to assess the ability of a compound to "bind to its natural ligand and/or to 
inhibit the binding of its natural ligand to its natural receptors. Examples of such assays include radio- 

5 labeling assays such as those described in U.S. Patent No. 5,914,236 and U.S. Patent No. 6,372,724. 
In a related embodiment, one or more amino acid substitutions can be introduced into a polypeptide 
compound (such as a receptor) to improve or alter its ability to bind to its natural ligands (Matthews, 
DJ. and J. A. Wells. (1994) Chem. BioL 1:25-30). In another related embodiment, one or more amino 
acid substitutions can be introduced into a polypeptide compound (such as a ligand) to improve or alter 

10 its ability to bind to its natural receptors (Cunningham, B.C. and J.A. Wells (1991) Proc. Natl. Acad. 
Sci. USA 88:3407-3411; Lowman, H.B. et al. (1991) J. Biol. Chem. 266:10982-10988). 

NTRAN, fragments of NTRAN, or variants of NTRAN may be used to screen for 
compounds that modulate the activity of NTRAN. Such compounds may include agonists, antagonists, 
or partial or inverse agonists. In one embodiment, an assay is performed under conditions permissive 

15 for NTRAN activity, wherein NTRAN is combined with at least one test compound, and the activity 
of NTRAN in the presence of a test compound is compared with the activity of NTRAN in the 
absence of the test compound. A change in the activity of NTRAN in the presence of the test 
compound is indicative of a compound that modulates the activity of NTRAN. Alternatively, a test 
compound is combined with an in vitro or cell-free system comprising NTRAN under conditions 

20 suitable for NTRAN activity, and the assay is performed. In either of these assays, a test compound 
which modulates the activity of NTRAN may do so indirectly and need not come in direct contact 
with the test compound. At least one and up to a plurality of test compounds maybe screened. 

In another embodiment, polynucleotides encoding NTRAN or their mammalian homologs may 
be "knocked out" in an animal model system using homologous recombination in embryonic stem (ES) 

25 cells. Such techniques are well known in the art and are useful for the generation of animal models of 
human disease (see, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337). For example, 
mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo and 
grown in culture. The ES cells are transformed with a vector containing the gene of interest disrupted 
by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) Science 

30 244:1288-1292). The vector integrates into the corresponding region of the host genome by 

homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specific maimer (Marfh, J.D. 
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(1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 25:4323-4330). 
Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudopregnant dams, and 
the resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous 

5 strains. Transgenic animals thus generated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encoding NTRAN may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 

10 (1998) Science 282:1145-1147). 

Polynucleotides encoding NTRAN can also be used to create <c knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding NTRAN is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the blastulae 

15 are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, 
a mammal inbred to overexpress NTRAN, e.g., by secreting NTRAN in its milk, may also serve as a 
convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

20 Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between 

regions of NTRAN and neurotransmission-associated proteins. In addition, examples of tissues 
expressing NTRAN can be found in Table 6 and can also be found in Example XI. Therefore, 
NTRAN appears to play a role in autoimmune/inflammatory, cardiovascular, neurological, 
developmental, cell proliferative, transport, psychiatric, metabolic, and endocrine disorders. In the 

25 treatment of disorders associated with increased NTRAN expression or activity, it is desirable to 
decrease the expression or activity of NTRAN. In the treatment of disorders associated with 
decreased NTRAN expression or activity, it is desirable to increase the expression or activity of 
NTRAN. 

Therefore, in one embodiment, NTRAN or a fragment or derivative thereof may be 
30 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of NTRAN. Examples of such disorders include, but are not limited to, an 
autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), Addison's 
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disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic 

5 lymphopenia withlymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves* disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
pericardial inflammation, syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic 
sclerosis, thromb osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrome, 

10 rheumatoid arthritis, scleroderma, Sjogren's ocytopenic purpura, ulcerative colitis, uveitis, Werner 
syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
fungal, parasitic, protozoal, and helminthic infections, and trauma; a cardiovascular disorder such as 
congestive heart failure, ischemic heart disease, angina pectoris, myocardial infarction, hypertensive 
heart disease, degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid 

15 aortic valve, mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart 
disease, infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic hipus 
erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart i 
disease, congenital heart disease, and complications of cardiac transplantation, arteriovenous fistula,, , 
atherosclerosis, hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose 

20 veins, thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, 
balloon angioplasty, vascular replacement, and coronary artery bypass graft surgery; a neurological 
disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Pick's 
disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, 

25 retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial 
and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases including 
kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal familial 
insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous . 

30 sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation 
and other developmental disorders of the central nervous system including Down syndrome, cerebral 
palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
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diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia 
gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, 
seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 

5 dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a developmental 
disorder such as renal tubular acidosis, anemia, Dishing' s syndrome, achondroplastic dwarfism, 
Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome (Wilms' 
tumor, aniridia, genitourinary abnormalities, and mental retardation), Smith-Magenis syndrome, 

10 myelodysplastic syndrome, hereditary mucoepithelial dysplasia, hereditary keratodermas, hereditary 
neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, 
hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, spina bifida, 
anencephaly, craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing loss; a cell 
proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 

15 hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 

hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, colon, gall : 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 

20 penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus and a cancer such as 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, 
ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a transport disorder such as 

25 akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's muscular 

dystrophy, BelTs palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, diabetic 
neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, nonnokalemic periodic 
paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia gravis, 
myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral 

30 neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, bradyarrythmia, 
tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline myopathy, 
centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, ethanol 
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myopathy, dermatomyositis, inclusion body myositis, infections myositis, polymyositis, neurological 
disorders associated with transport, e.g., AMieimer's disease, amnesia, bipolar disorder, dementia, 
depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia, and other disorders 
associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, 

5 sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery stenosis, 
sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter, Cushing's 
disease, Addison's disease, glucose-galactose malabsorption syndrome, hypercholesterolemia, 
adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn syndrome, von Gierke 
disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a psychiatric disorder such 

10 as acute stress disorder, alcohol dependence, amphetamine dependence, anorexia nervosa, antisocial 
personality disorder, attention-deficit hyperactivity disorder, autistic disorder, anxiety, avoidant 
personality disorder, bipolar disorder, borderline personality disorder, brief psychotic disorder, bulimia 
nervosa, cannabis dependence, cocaine dependence, conduct disorder, cyclothymic disorder, delirium, 
delusional disorder, dementia, dependent personality disorder, depression, dysthymic disorder, 

15 hallucinogen dependence, histrionic personality disorder, inhalant dependence, manic depression, 
multi-infarct dementia, narcissistic personality disorder, nicotine dependence, obsessive-compulsive 
disorder, opioid dependence, oppositional defiant disorder, panic disorder, paranoid personality disorder, 
phencyclidine dependence, phobia, posttraumatic stress disorder, schizoaffective disorder, schizoid 
personality disorder, schizophrenia, sedative dependence, separation anxiety disorder, and sleep 

20 disorder; a metabolic disorder such as Addison's disease, cerebrotendinous xanthomatosis, congenital 
adrenal hyperplasia, coumariu resistance, cystic fibrosis, fatty hepatockrhosis, 

fructose- 1,6-diphosphatase deficiency, galactosemia, goiter, glucagonoma, glycogen storage diseases, 
hereditary fructose intolerance, hyperadrenalism, hypoadrenalism, hyperparathyroidism, 
hypoparathyroidism, hypercholesterolemia, hyperthyroidism, hypoglycemia, hypothyroidism, 

25 hyperlipidemia, hyperlipemia, lipid myopathies, lipodystrophies, lysosomal storage diseases, 

mannosidosis, neuraminidase deficiency, obesity, osteoporosis, phenylketonuria, pseudovitamin D- 
deficiency rickets, disorders of carbohydrate metabolism such as congenital type II dyserythropoietic 
anemia, diabetes, insulin-dependent diabetes mellitus, non-insulin-dependent diabetes mellitus, 
galactose epimerase deficiency, glycogen storage diseases, lysosomal storage diseases, fructosuria, 

30 pentosuria, and inherited abnormalities of pyruvate metabolism, disorders of lipid metabolism such as 
fatty liver, cholestasis, primary biliary cirrhosis, carnitine deficiency, carnitine palmitoyltransferase 
deficiency, myoadenylate deaminase deficiency, hypertriglyceridemia, lipid storage disorders such 
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Fabry's disease, Gaucher's disease, Niemann-Pick's disease, metachromatic leukodystrophy, 
adrenoleukodystrophy, GM 2 gangliosidosis, and ceroid lipofuscinosis, abetalipoproteinemia, Tangier 
disease, hyperlipoproteinemia, lipodystrophy, lipomatoses, acute panniculitis, disseminated fat necrosis, 
adiposis dolorosa, lipoid adrenal hyperplasia, minimal change disease, lipomas, atherosclerosis, 

5 hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, primary 

hypoalphalipoproteinemia, hypothyroidism, renal disease, liver disease, lecithinxholesterol 
acyhransferase deficiency, cerebrotendinous xanthomatosis, sitosterolemia, hypocholesterolemia, Tay- 
Sachs disease, SandhofPs disease, hyperlipidemia, hyperlipemia, and lipid myopathies, and disorders of 
copper metabolism such as Menke's disease, Wilson's disease, and Ehlers-Danlos syndrome type IX 

10 diabetes; and an endocrine disorder such as a disorder of the hypothalamus and/or pituitary resulting 
from lesions such as a primary brain tumor, adenoma, infarction associated with pregnancy, 
hypophysectomy, aneurysm, vascular malformation, thrombosis, infection, immunological disorder, and 
complication due to head trauma, a disorder associated with hypopituitarism including hypogonadism, 
Sheehan syndrome, diabetes insipidus, Kallman's disease, Hand-Schuller-Christian disease, Letterer- 

15 Siwe disease, sarcoidosis, empty sella syndrome, and dwarfism, a disorder associated with 

hyperpituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic hormone 
(ADH) secretion (SIADH) often caused by benign adenoma, a disorder associated with 
hypothyroidism including goiter, myxedema, acute thyroiditis associated with bacterial infection, 
subacute thyroiditis associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), and 

20 cretinism, a disorder associated with hyperthyroidism including thyrotoxicosis and its various forms, 
Grave's disease, pretibial myxedema, toxic multinodular goiter, thyroid carcinoma, and Plummer's 
disease, a disorder associated with hyperparathyroidism including Conn disease (chronic 
hypercalemia), a pancreatic disorder such as Type I or Type II diabetes mellitus and associated 
complications, a disorder associated with the adrenals such as hyperplasia, carcinoma, or adenoma of 

25 the adrenal cortex, hypertension associated with alkalosis, amyloidosis, hypokalemia, Cushing's 
disease, Liddle's syndrome, and Arnold-Healy-Gordon syndrome, pheochromocytoma tumors, and 
Addison's disease, a disorder associated with gonadal steroid hormones such as: in women, abnormal 
prolactin production, infertility, endometriosis, perturbation of the menstrual cycle, polycystic ovarian 
disease, hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, galactorrhea, 

30 hermaphroditism, hirsutism and virilization, breast cancer, and, in post-menopausal women, 

osteoporosis, and, in men, Leydig cell deficiency, male climacteric phase, and germinal cell aplasia, a 
hypergonadal disorder associated with Leydig cell tumors, androgen resistance associated with 
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absence of androgen receptors, syndrome of 5 a-reductase, and gynecomastia. 

In another embodiment, a vector capable of expressing NTRAN or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of NTRAN including, but not limited to, those described above. 
5 In a further embodiment, a composition comprising a substantially purified NTRAN in 

conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased expression or activity of NTRAN including, but not limited to, 
those provided above. 

In still another embodiment, an agonist which modulates the activity of NTRAN may be 
10 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of NTRAN including, but not limited to, those listed above. 

In a further embodiment, an antagonist of NTRAN may be administered to a subject to treat 
or prevent a disorder associated with increased expression or activity of NTRAN. Examples of such 
disorders include, but are not limited to, those autoimmune/inflammatory, cardiovascular, neurological, 
15 developmental, cell proliferative, transport, psychiatric, metabolic, and endocrine disorders described 
above. In one aspect, an antibody which specifically binds NTRAN may be used directly as an 
antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to 
. cells or tissues which express NTRAN. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
20 encoding NTRAN may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of NTRAN including, but not limited to, those described above. 

In other embodiments, any protein, agonist, antagonist, antibody, complementary sequence, or 
vector embodiments may be administered in combination with other appropriate therapeutic agents. 
Selection of the appropriate agents for use in combination therapy may be made by one of ordinary 
25 skill in the art, according to conventional pharmaceutical principles. The combination of therapeutic 
agents may act synergistically to effect the treatment or prevention of the various disorders described 
above. Using this approach, one may be able to achieve therapeutic efficacy with lower dosages of 
each agent, thus reducing the potential for adverse side effects. 

An antagonist of NTRAN may be produced using methods which are generally known in the 
30 art In particular, purified NTRAN may be used to produce antibodies or to screen libraries of 

pharmaceutical agents to identify those which specifically bind NTRAN. Antibodies to NTRAN may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
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not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. In an embodiment, neutralizing antibodies (i.e., those 
which inhibit dimer formation) can be used therapeutically. Single chain antibodies (e.g., from camels 
or llamas) may be potent enzyme inhibitors and may have application in the design of peptide mimetics, 
5 and in the development of immuno-adsorbents and biosensors (Muyldermans, S. (2001) J. Biotechnol. 
74:277-302). 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, camels, 
dromedaries, llamas, humans, and others may be immunized by injection with NTRAN or with any 
fragment or oligopeptide thereof which has immunogenic properties. Depending on the host species, 
10 various adjuvants may be used to increase immunological response. Such adjuvants include, but are 
not limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among 
adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corynebacterium parvum are 
especially preferable. 

15 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 

NTRAN have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are substantially identical to a portion of the amino acid sequence of the natural protein. 
Short stretches of NTTRAN amino acids may be fused with those of another protein, such as KLH, 

20 and antibodies to the chimeric molecule may be produced. 

Monoclonal antibodies to NTRAN may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique (Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al (1985) J. Immunol. Methods 

25 81:31-42; Cote, R.J. et al. (1983) Proc. Natl Acad. Sci. USA 80:2026-2030; Cole, S.P. et al. (1984) 
MoL Cell BioL 62:109-120). 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used (Morrison, S.L. et al. (1984) Proc. Natl. Acad. 

30 Sci. USA 81:6851-6855; Neuberger, M.S. et aL (1984) Nature 312:604-608; Takeda, S. et al. (1985) 
Nature 314:452-454). Alternatively, techniques described for the production of single chain antibodies 
may be adapted, using methods known in the art, to produce NTRAN-specific single chain antibodies. 
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Antibodies with related specificity, but of distinct idiotypic composition, may be generated by chain 
shuffling from random combinatorial immunoglobulin libraries (Burton, D.R. (1991) Proc. Natl. Acad. 
Sci. USA 88:10134-10137). 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
5 or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature (Orlandi, R. et aL (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, G. et al. 
(1991) Nature 349:293-299). 

Antibody fragments which contain specific binding sites for NTRAN may also be generated. 
For example, such fragments include, but are not limited to, F(ab 7 ) 2 fragments produced by pepsin 
10 digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab*)2 fragments. Alternatively, Fab expression libraries maybe constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity (Huse, W.D. et al. (1989) 
Science 246:1275-1281). 

Various immunoassays may be used for screening to identify antibodies having the desired 
15 specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between NTRAN and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 
to two non-interfering NTRAN epitopes is generally used, but a competitive binding assay may also be 
20 employed (Pound, supra). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for NTRAN. Affinity is expressed as an association 
constant, K,, which is defined as the molar concentration of NTRAN-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The K, 
25 determined for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for 
multiple NTRAN epitopes, represents the average affinity, or avidity, of the antibodies for NTRAN. 
The Ka determined for a preparation of monoclonal antibodies, which are monospecific for a particular 
NTRAN epitope, represents a true measure of affinity. High-affinity antibody preparations with K> 
ranging from about 10 9 to 10 12 L/mole are preferred for use in immunoassays in which the NTRAN- 
30 antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations with 
ranging from about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar 
procedures which ultimately require dissociation of NTRAN, preferably in active form, from the 
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antibody (Catty, D. (1988) Antibodies, Volume I: A Practical Approach , IRL Press, Washington DC; 
Liddell, J.E. and A. Cryer (1991) A Practical Guide to Monoclonal Antibodies , John Wiley & Sons, 
New York NY). 

The titer and avidity of polyclonal antibody preparations maybe further evaluated to determine 
5 the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of NTRAN-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available (Catty, supra; Coligan et al, 
10 supra). 

In another embodiment of the invention, polynucleotides encoding NTRAN, or any fragment 
or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 
RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding 

15 NTRAN. Such technology is well known in the art, and antisense oligonucleotides or larger fragments 
can be designed from various locations along the coding or control regions of sequences encoding 
NTRAN (Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press, Totawa NJ). 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 

20 intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein (Slater, J.E. et 
al. (1998) J. Allergy Clin. Immunol. 102:469-475; Scanlon, KJ. et al. (1995) 9:1288-1296). Antisense 
sequences can also be introduced intracellularly through the use of viral vectors, such as retrovirus and 
adeno-associated virus vectors (Miller, A.D. (1990) Blood 76:271; Ausubel et al., supra; Uckert, W. 

25 and W. Walther (1994) Pharmacol. Tber. 63 323-347). Other gene delivery mechanisms include 
liposome-derived systems, artificial viral envelopes, and other systems known in the art (Rossi, J.J. 
(1995) Br. Med. Bull. 51:217-225; Boado, RJ. et al. (1998) J. Fhaim. Sci. 87:1308-1315; Morris, 
M.C. et al. (1997) Nucleic Acids Res. 25:2730-2736). 

Ea another embodiment of the invention, polynucleotides encoding NTRAN may be used for 

30 somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SOD)-Xl disease characterized by X- 
linked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
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immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 

5 hypercholesterolemia, and hemophilia resulting from Factor VIQ or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, LM. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 
against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 

10 Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA 93:11395-11399), hepatitis 
B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi). In 
the case where a genetic deficiency in NTRAN expression or regulation causes disease, the 
expression of NTRAN from an appropriate population of transduced cells may alleviate the clinical 

15 manifestations caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
NTRAN are treated by constructing mammalian expression vectors encoding NTRAN and 
introducing these vectors by mechanical means into NTRAN-deficient cells. Mechanical transfer 
technologies for use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual . 

20 cells, (ii) ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated 
gene transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. 
Rev. Biochem. 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J.-L. and H. Recipon (1998) 
Curr. Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of NTRAN include, but are not 

25 limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 

(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), 
and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). NTRAN 
maybe expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous 
sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or P-actin genes), (ii) an inducible promoter 

30 (e.g., the tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. 
USA 89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and HM. Blau 
(1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); 
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the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding NTRAN from a normal individual. 
5 Commercially available liposome transformation kits (e.g. , the PERFECT IIPED 

TRANSACTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, FX. and A. J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
10 (1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

In another embodiment of die invention, diseases or disorders caused by genetic defects with 
respect to NTRAN expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding NTRAN under the control of an independent promoter or the retrovirus long 
15 terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus ris-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, L et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
20 an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol 61:1647-1650; Bender, M.A. et aL (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. et 
al. (1998) J. Virol 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ("Method for obtaining 
25 retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses 
a method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 
Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T-ceHs), and the 
return of transduced cells to a patient are procedures well known to persons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. ViroL 71:7020-7029; Bauer, G. et 
30 al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. ViroL 71:4707-4716; Ranga, U. et al. (1998) 
Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In an embodiment, an adenovirus-based gene therapy delivery system is used to deliver 
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polynucleotides encoding NTRAN to cells which have one or more genetic abnormalities with respect 
to the expression of NTRAN. The construction and packaging of adenovirus-based vectors are well 
known to those with ordinary skill in the art Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
5 (Csete, M.E. et aL (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene therapy 7 '), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et aL (1999; Annu. 
Rev Nutx. 19:511-544) and Verma, LM. and N. Somia (1997; Nature 18:389:239-242). 

In another embodiment, a herpes-based, gene therapy delivery system is used to deliver 
10 polynucleotides encoding NTRAN to target cells which have one or more genetic abnormalities with 
respect to the expression of NTRAN. The use of herpes simplex virus (HSV)-based vectors maybe 
especially valuable for introducing NTRAN to cells of the central nervous system, for which HSV has 
a tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1-based vector has 
15 been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent No. 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is hereby 
incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV d92 
which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
20 the control of the appropriate promoter for purposes including human gene therapy. Also taught by 
this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and 
ICP22. For HSV vectors, see also Goins, W.F. et al. (1999; J. Virol. 73:519-532) and Xu, H. et aL 
(1994; Dev. Biol. 163:152-161). The manipulation of cloned herpesvirus sequences, the generation of 
recombinant virus following the transfection of multiple plasmids containing different segments of the 
25 large herpesvirus genomes, the growth and propagation of herpesvirus, and the infection of cells with 
herpesvirus are techniques well known to those of ordinary skill in the art. 

In another embodiment, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding NTRAN to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
30 on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. BiotechnoL 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, 
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resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting the coding sequence for NTRAN into the 
alphavirus genome in place of the capsid-coding region results in the production of a large number of 
NTRAN-coding RNAs and the synthesis of high levels of NTRAN in vector transduced cells. While 

5 alphavirus infection is typically associated with cell lysis within a few days, the ability to establish a 
persistent infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) 
indicates that the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy 
application (Dryga, S.A. et al (1997) Virology 228:74-83). The wide host range of alphaviruses will 
allow the introduction of NTRAN into a variety of cell types. The specific transduction of a subset of 

10 cells in a population may require the sorting of cells prior to transduction. The methods of 
manipulating infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA 
transfections, and performing alphavirus infections, are well known to those with ordinary skill in the 
art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
15 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 

be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 

inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 

transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have . . 

been described in the literature (Gee, J.E. et aL (1994) in Huber, B.E. and B.L Carr, Molecular and 
20 Immunologic Approaches , Futura Publishing, Mt Kisco NY, pp. 163-177). A complementary 

sequence or antisense molecule may also be designed to block translation of mRNA by preventing the 

transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
25 molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 

engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 

endonucleolytic cleavage of RNA molecules encoding NTRAN. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
30 GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 

corresponding to the region of the target gene containing the cleavage site, may be evaluated for 

secondary structural features which may render the oligonucleotide inoperable. The suitability of 
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candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes maybe prepared by any method 
known in the art for the synthesis of nucleic acid molecules. These include techniques for chemically 

5 synthes izin g oligonucleotides such as solid phase phosphoramidite chemical synthesis. Alternatively, 
RNA molecules may be generated by in vitro and in vivo transcription of DNA molecules encoding 
NTRAN. Such DNA sequences may be incorporated into a wide variety of vectors with suitable 
RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or 

10 tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3' ends 
of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 

15 extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 
and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

In other embodiments of the invention, the expression of one or more selected polynucleotides 
of the present invention can be altered, inhibited, decreased, or silenced using RNA interference 

20 (RNAi) or post-transcriptional gene silencing (PTGS) methods known in the art. RNAi is a post- 
transcriptional mode of gene silencing in which double-stranded RNA (dsRNA) introduced into a 
targeted cell specifically suppresses the expression of the homologous gene (i.e., the gene bearing the 
sequence complementary to the dsRNA). This effectively knocks out or substantially reduces the 
expression of the targeted gene. PTGS can also be accomplished by use of DNA or DNA fragments 

25 as well. RNAi methods are described by Fire, A. et al. (1998 ; Nature 39 1 :806-8 1 1) and Gura, T. 

(2000; Nature 404:804-808). PTGS can also be initiated by introduction of a complementary segment 
of DNA into the selected tissue using gene delivery and/or viral vector delivery methods described 
herein or known in the art. 

RNAi can be induced in mammalian cells by the use of small interfering RNA also known as 

30 siRNA. SiRNA are shorter segments of dsRNA (typically about 21 to 23 nucleotides in length) that 
result in vivo from cleavage of introduced dsRNA by the action of an endogenous ribonuclease. 
SiRNA appear to be the mediators of the RNAi effect in mammals. The most effective siRNAs 
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appear to be 21 nucleotide dsRNAs with 2 nucleotide 3* overhangs. The use of siRNA for inducing 
RNAi in mammalian cells is described by Elbashir, S.M. et aL (2001; Nature 411:494-498). 

SiRNA can either be generated indirectly by introduction of dsRNA into the targeted cell, or 
directly by mammalian transfection methods and agents described herein or known in the art (such as 

5 liposome-mediated transfection, viral vector methods, or other polynucleotide delivery/introductory 
methods). Suitable SiRNAs can be selected by examining a transcript of the target polynucleotide 
(e.g., mRNA) for nucleotide sequences downstream from the AUG start codon and recording the 
occurrence of each nucleotide and the 3* adjacent 19 to 23 nucleotides as potential siRNA target sites, 
with sequences having a 21 nucleotide length being preferred. Regions to be avoided for target 

10 siRNA sites include the 5' and 3' untranslated regions (UTRs) and regions near the start codon (within 
75 bases), as these may be richer in regulatory protein binding sites. TJTR-binding proteins and/or 
translation initiation complexes may interfere with binding of the siRNP endonuclease complex. The 
selected target sites for siRNA can then be compared to the appropriate genome database (e.g., 
human, etc.) using BLAST or other sequence comparison algorithms known in the art. Target 

15 sequences with significant homology to other coding sequences can be eliminated from consideration. 
The selected SiRNAs can be produced by chemical synthesis methods known in the art or by in vitro 
transcription using commercially available methods and kits such as the SILENCER siRNA 
construction kit (Ambion, Austin TX). 

In alternative embodiments, long-term gene silencing and/or RNAi effects can be induced in 

20 selected tissue using expression vectors that continuously express siRNA. This can be accomplished 
using expression vectors that are engineered to express hairpin RNAs (shRNAs) using methods 
known in the art (see, e.g., Brummelkamp, T.R. et al. (2002) Science 296:550-553; and Paddison, P.J. 
et al. (2002) Genes Dev. 16:948-958). In these and related embodiments, shRNAs can be delivered to 
target cells using expression vectors known in the art. An example of a suitable expression vector for 

25 delivery of siRNA is the PSILENCER1.0-U6 (circular) plasmid (Ambion). Once delivered to the 
target tissue, shRNAs are processed in vivo into siRNA-like molecules capable of carrying out gene- 
specific silencing. 

In various embodiments, the expression levels of genes targeted by RNAi or PTGS methods 
can be determined by assays for mRNA and/or protein analysis. Expression levels of the mRNA of a 
30 targeted gene, can be determined by northern analysis methods using, for example, the 

NORTHERNMAX-GLY kit (Ambion); by microairay methods; by PCR methods; by real time PCR 
methods; and by other RNA/polynucleotide assays known in the art or described herein. Expression 
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levels of the protein encoded by the targeted gene can be determined by Western analysis using 
standard techniques known in the art. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding NTRAN. 

5 Compounds which may be effective in altering expression of a specific polynucleotide may include, but 
are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 

10 polynucleotide expression. Thus, in the treatment of disorders associated with increased NTRAN 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding NTRAN may be therapeutically useful, and in the treatment of disorders associated with 
decreased NTRAN expression or activity, a compound which specifically promotes expression of the 
polynucleotide encoding NTRAN may be therapeutically useful. 

15 In various embodiments, one or more test compounds may be screened for effectiveness in 

altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in: 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary . 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 

20 based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding NTRAN is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding NTRAN are assayed 

25 by any method commonly known in the art. Typically, the expression of a specific nucleotide is 

detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding NTRAN. The amount of hybridization may be quantified, thus forming 
the basis for a comparison of the expression of the polynucleotide both with and without exposure to 
one or more test compounds. Detection of a change in the expression of a polynucleotide exposed to 

30 a test compound indicates that the test compound is effective in altering the expression of the 

polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example, using a Schizosaccharomyces pombe gene expression system 
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(Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids Res. 
28:E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. 
Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic 
5 acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide sequence 
(Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 
6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo, in vitro > and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells 
10 taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art (Goldman, C.K. et al. (1997) Nat. Biotechnol. 15:462- 
466). 

Any of the therapeutic methods described above may be applied to any subject in need of 
15 such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a composition which 
generally comprises an active ingredient formulated with a pharmaceutically acceptable excipient. 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 

20 formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of 
NTRAN, antibodies to NTRAN, and mimetics, agonists, antagonists, or inhibitors of NTRAN. 

In various embodiments, the compositions described herein, such as pharmaceutical 
compositions, may be administered by any number of routes including, but not limited to, oral, 

25 intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, pulmonary, 
transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
These compositions are generally aerosolized immediately prior to inhalation by the patient. In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast- 

30 acting formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides and 
proteins), recent developments in the field of pulmonary delivery via the alveolar region of the long 
have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. 
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et aL, U.S. Patent No. 5,997,848). Pulmonary delivery allows administration without needle injection, 

and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 

ingredients are contained in an effective amount to achieve the intended purpose. The determination 
5 of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 

macromolecules comprising NTRAN or fragments thereof. For example, liposome preparations 

containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of the 

macromolecule. Alternatively, NTRAN or a fragment thereof may be joined to a short cationic N- 
10 terminal portion from the HTV Tat-1 protein. Fusion proteins thus generated have been found to 

transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 

aL (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 

culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
15 or pigs. An animal model may also be used to determine the appropriate concentration range and 

route of administration. Such information can then be used to determine useful doses and routes for 

administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 

NTRAN or fragments thereof, antibodies of NTRAN, and agonists, antagonists or inhibitors of 
20 NTRAN, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 

determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such 

as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the 

dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 

therapeutic index, which can be expressed as the LD 50 /ED 50 ratio. Compositions which exhibit large 
25 therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 

used to formulate a range of dosage for human use. The dosage contained in such compositions is 

preferably within a range of circulating concentrations that includes the ED 50 with little or no toxicity. 

The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 

patient, and the route of administration. 
30 The exact dosage will be determined by the practitioner, in light of factors related to the 

subject requiring treatment Dosage and administration are adjusted to provide sufficient levels of the 

active moiety or to maintain the desired effect Factors which may be taken into account include the 



86 



WO 03/051902 



PCT7US02/40059 



severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and response 
to therapy. Long-acting compositions maybe administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

5 Normal dosage amounts may vary from about 0. 1 y,% to 100,000 /^g, up to a total dose of 

about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

10 conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind NTRAN may be used for the 
diagnosis of disorders characterized by expression of NTRAN, or in assays to monitor patients being 
treated with NTRAN or agonists, antagonists, or inhibitors of NTRAN. Antibodies useful for 

15 diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for NTRAN include methods which utilize the antibody and a label to detect 
NTRAN in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 
without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in 

20 the art and may be used. 

A variety of protocols for measuring NTRAN, including ELISAs, RIAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of NTRAN expression. 
Normal or standard values for NTRAN expression are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, for example, human subjects, with antibodies to 

25 NTRAN under conditions suitable for complex formation. The amount of standard complex formation 
maybe quantitated by various methods, such as photometric means. Quantities of NTRAN expressed 
in subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, polynucleotides encoding NTRAN may be used for 

30 diagnostic purposes. The polynucleotides which may be used include oligonucleotides, complementary 
RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and quantify gene 
expression in biopsied tissues in which expression of NTRAN may be correlated with disease. The 
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diagnostic assay maybe used to determine absence, presence, and excess expression of NTRAN, and. 
to monitor regulation of NTRAN levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotides, 
including genomic sequences, encoding NTRAN or closely related molecules maybe used to identity 
5 nucleic acid sequences which encode NTRAN. The specificity of the probe, whether it is made from 
a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a conserved 
motif, and the stringency of the hybridization or amplification will determine whether the probe 
identifies only naturally occurring sequences encoding NTRAN, allelic variants, or related sequences. 
Probes may also be used for the detection of related sequences, and may have at least 50% 
10 sequence identity to any of the NTRAN encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:23-44 or from 
genomic sequences including promoters, enhancers, and introns of the NTRAN gene. 

Means for producing specific hybridization probes for polynucleotides encoding NTRAN 
include the cloning of polynucleotides encoding NTRAN or NTRAN derivatives into vectors for the 
15 production of roRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety.of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, . 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 
20 Polynucleotides encoding NTRAN may be used for the diagnosis of disorders associated with 

expression of NTRAN. Examples of such disorders include, but are not limited to, an 
autoimmune/inflammatory disorder such as acquired immunodeficiency syndrome (AIDS), Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
25 polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, episodic 
lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic gastritis, 
glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's thyroiditis, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, myocardial or 
30 pericardial inflammation, syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic 
sclerosis, thromb osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrome, 
rheumatoid arthritis, scleroderma, Sjogren's ocytopenic purpura, ulcerative colitis, uveitis, Werner 
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syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
fungal, parasitic, protozoal, and helminthic infections, and trauma; a cardiovascular disorder such as 
congestive heart failure, ischemic heart disease, angina pectoris, myocardial infarction, hypertensive 
heart disease, degenerative valvular heart disease, calcific aortic valve stenosis, congenitally bicuspid 
5 aortic valve, mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart 
disease, infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 
erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
disease, congenital heart disease, and complications of cardiac transplantation, arteriovenous fistula, 
atherosclerosis, hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose 
10 veins, thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, 
balloon angioplasty, vascular replacement, and coronary artery bypass graft surgery; a neurological 
disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Pick's 
disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, 
15 retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial 
and viral meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases including, 
kuru, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal familial 
insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, tuberous 
20 sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal syndrome, mental retardation 
and other developmental disorders of the central nervous system including Down syndrome, cerebral 
palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia 
25 gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, 
seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; a developmental 
disorder such as renal tubular acidosis, anemia, Cushing's syndrome, achondroplastic dwarfism, 
30 Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome (Wilms' 
tumor, aniridia, genitourinary abnormalities, and mental retardation), Smith-Magenis syndrome, 
myelodysplasia syndrome, hereditary mucoepithelial dysplasia, hereditary keratodermas, hereditary 
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neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, 
hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, spina bifida, 
anencephaly, craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing loss; a cell 
proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
5 hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 

hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, colon, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
10 penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus and a cancer such as 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, 
ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; a transport disorder such as 
15 akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's muscular 

dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, diabetic 
neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic periodic 
paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia gravis, 
myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral 
20 neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, bradyarrythmia, 
tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline myopathy, 
centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, ethanol 
myopathy, dermatomyositis, inclusion body myositis, infectious myositis, polymyositis, neurological 
disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disorder, dementia, 
25 depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia, and other disorders 
associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal neuropathy, 
sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery stenosis, 
sensorineural autosomal deafiaess, hyperglycemia, hypoglycemia, Grave's disease, goiter, Cushing's 
disease, Addison's disease, glucose-galactose malabsorption syndrome, hypercholesterolemia, 
30 adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn syndrome, von Gierke 
disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a psychiatric disorder such 
as acute stress disorder, alcohol dependence, amphetamine dependence, anorexia nervosa, antisocial 
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personality disorder, attention-deficit hyperactivity disorder, autistic disorder, anxiety, avoidant 
personality disorder, bipolar disorder, borderline personality disorder, brief psychotic disorder, bulimia 
nervosa, cannabis dependence, cocaine dependence, conduct disorder, cyclothymic disorder, delirium, 
delusional disorder, dementia, dependent personality disorder, depression, dysthymic disorder, 

5 hallucinogen dependence, histrionic personality disorder, inhalant dependence, manic depression, 
multi-infarct dementia, narcissistic personality disorder, nicotine dependence, obsessive-compulsive 
disorder, opioid dependence, oppositional defiant disorder, panic disorder, paranoid personality disorder, 
phencyclidine dependence, phobia, posttraumatic stress disorder, schizoaffective disorder, schizoid 
personality disorder, schizophrenia, sedative dependence, separation anxiety disorder, and sleep 

10 disorder; a metabolic disorder such as Addisbn's disease, cerebrotendinous xanthomatosis, congenital 
adrenal hyperplasia, coumarin resistance, cystic fibrosis, fatty hepatocirrhosis, 

fructose-l,6-diphosphatase deficiency, galactosemia, goiter, glucagonoma, glycogen storage diseases, 
hereditary fructose intolerance, hyperadrenalism, hypoadrenalism, hyperparathyroidism, 
hypoparathyroidism, hypercholesterolemia, hyperthyroidism, hypoglycemia, hypothyroidism, 

15 hyperlipidemia, hyperlipemia, lipid myopathies, lipodystrophies, lysosomal storage diseases, 

mannosidosis, neuraminidase deficiency, obesity, osteoporosis, phenylketonuria, pseudovitamin D- 
deficiency rickets, disorders of carbohydrate metabolism such as congenital type II dyserythropoietic 
anemia, diabetes, insulin-dependent diabetes mellitus, non-insulin-dependent diabetes mellitus, 
galactose epimerase deficiency, glycogen storage diseases, lysosomal storage diseases, fructosuria, 

20 pentosuria, and inherited abnormalities of pyruvate metabolism, disorders of lipid metabolism such as 
fatty liver, cholestasis, primary biliary cirrhosis, carnitine deficiency, carnitine palmitoyltransferase 
deficiency, myoadenylate deaminase deficiency, hypertriglyceridemia, lipid storage disorders such 
Fabry's disease, Gaucher' s disease, Niemann-Pick's disease, metachromatic leukodystrophy, 
adrenoleukodystrophy, GM 2 gangliosidosis, and ceroid lipofuscinosis, abetalipoproteinemia, Tangier 

25 disease, hyperlipoproteinemia, lipodystrophy, lipomatoses, acute panniculitis, disseminated fat necrosis, 
adiposis dolorosa, lipoid adrenal hyperplasia, minimal change disease, lipomas, atherosclerosis, 
hypercholesterolemia, hypercholesterolemia with hypertriglyceridemia, primary 
hypoalphalipoproteinemia, hypothyroidism, renal disease, liver disease, lecithin:cholesterol 
acyltransferase deficiency, cerebrotendinous xanthomatosis, sitosterolemia, hypocholesterolemia, Tay- 

30 Sachs disease, SandhofPs disease, hyperlipidemia, hyperlipemia, and lipid myopathies, and disorders of 
copper metabolism such as Menke's disease, Wilson's disease, and Ehlers-Danlos syndrome type IX 
diabetes; and an endocrine disorder such as a disorder of the hypothalamus and/or pituitary resulting 
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from lesions such as a primary brain tumor, adenoma, infarction associated with pregnancy, 
hypophysectomy, aneurysm, vascular malformation, thrombosis, infection, immunological disorder, and 
complication due to head trauma, a disorder associated with hypopituitarism including hypogonadism, 
Sheehan syndrome, diabetes insipidus, Kallman's disease, Hand-SchuUer-Cbristian disease, Letterer- 
5 Siwe disease, sarcoidosis, empty sella syndrome, and dwarfism, a disorder associated with 

hyperpituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic hormone 
(ADH) secretion (SIADH) often caused by benign adenoma, a disorder associated with 
hypothyroidism including goiter, myxedema, acute thyroiditis associated with bacterial infection, 
subacute thyroiditis associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), and 

10 cretinism, a disorder associated with hyperthyroidism including thyrotoxicosis and its various forms, 
Grave's disease, pretibial myxedema, toxic multinodular goiter, thyroid carcinoma, and Phimmer's 
disease, a disorder associated with hyperparathyroidism including Conn disease (chronic 
hypercalemia), a pancreatic disorder such as Type I or Type II diabetes mellitus and associated 
complications, a disorder associated with the adrenals such as hyperplasia, carcinoma, or adenoma of 

15 the adrenal cortex, hypertension associated with alkalosis, amyloidosis, hypokalemia, Cushing's 
disease, Liddle's syndrome, and Arnold-Healy-Gordon syndrome, pheochromocytoma tumors, and 
Addison's disease, a disorder associated with gonadal steroid hormones such as: in women, abnormal 
prolactin production, infertility, endometriosis, perturbation of the menstrual cycle, polycystic ovarian 
disease, hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, galactorrhea, 

20 hermaphroditism, hirsutism and virilization, breast cancer, and, in post-menopausal women, 

osteoporosis, and, in men, Leydig cell deficiency, male climacteric phase, and germinal cell aplasia, a 
hypergpnadal disorder associated with Leydig cell tumors, androgen resistance associated with 
absence of androgen receptors, syndrome of 5 a-reductase, and gynecomastia. Polynucleotides 
encoding NTRAN may be used in Southern or northern analysis, dot blot, or other membrane-based 

25 technologies; in PCR technologies; in dipstick, pin, and muMformat ELISA-like assays; and in 
microarrays utilizing fluids or tissues from patients to detect altered NTRAN expression. Such 
qualitative or quantitative methods are well known in the art. 

In a particular embodiment, polynucleotides encoding NTRAN may be used in assays that 
detect the presence of associated disorders, particularly those mentioned above. Polynucleotides 

30 complementary to sequences encoding NTRAN may be labeled by standard methods and added to a 
fluid or tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes. After a suitable incubation period, the sample is washed and the signal is quantified and 
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compared with a standard value. If the amount of signal in the patient sample is significantly altered in 
comparison to a control sample then the presence of altered levels of polynucleotides encoding 
NTRAN in the sample indicates the presence of the associated disorder. Such assays may also be 
used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical 

5 trials, or to monitor the treatment of an individual patient 

In order to provide a "basis for the diagnosis of a disorder associated with expression of 
NTRAN, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding NTRAN, under conditions suitable for hybridization or 

10 amplification. Standard hybridization may be quantified by comparing the values obtained from normal 
subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

15 Once the presence of a disorder is established and a treatment protocol is initiated, 

hybridization assays maybe repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

20 With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier, thereby preventing the development or further 

25 progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
NTRAN may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro. Oligomers will preferably contain a fragment of a polynucleotide 
encoding NTRAN, or a fragment of a polynucleotide complementary to the polynucleotide encoding 

30 NTRAN, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 



93 



WO 03/051902 



PCT/US02/40059 



In a particular aspect, oligonucleotide primers derived from polynucleotides encoding NTRAN 
may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions and 
deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods of 
SNP detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) and 

5 fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from polynucleotides 
encoding NTRAN are used to amplify DNA using the polymerase chain reaction (PCR). The DNA 
maybe derived, for example, from diseased or normal tissue, biopsy samples, bodily fluids, and the 
like. SNPs in the DNA cause differences in the secondary and tertiary structures of PCR products in 
single-stranded form, and these differences are detectable using gel electrophoresis in non-denaturing 

10 gels. In fSCCP, the oligonucleotide primers are fluorescently labeled, which allows detection of the 
amplimers in high-throughput equipment such as DNA sequencing machines. Additionally, sequence 
database analysis methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by 
comparing the sequence of individual overlapping DNA fragments which assemble into a common 
consensus sequence. These computer-based methods filter out sequence variations due to laboratory 

15 preparation of DNA and sequencing errors using statistical models and automated analyses of DNA 
sequence chromatograms. In the alternative, SNPs may be detected and characterized by mass 
spectrometry using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San 
Diego CA). 

SNPs may be used to study the genetic basis of human disease. For example, at least 16 
20 common SNPs have been associated with non-insulin-dependent diabetes mellitus. SNPs are also 
useful for examining differences in disease outcomes in monogenic disorders, such as cystic fibrosis, 
siclde cell anemia, or chronic granulomatous disease. For example, variants in the mannose-binding 
lectin, MBL2, have been shown to be correlated with deleterious pulmonary outcomes in cystic 
fibrosis. SNPs also have utility in pharmacogenomics, the identification of genetic variants that 
25 influence a patient's response to a drug, such as life-threatening toxicity. For example, a variation in 
N-acetyl transferase is associated with a high incidence of peripheral neuropathy in response to the 
anti-tuberculosis drug isoniazid, while a variation in the core promoter of the ALOX5 gene results in 
diminished clinical response to treatment with an anti-asthma drug that targets the 5-lipoxygenase 
pathway. Analysis of the distribution of SNPs in different populations is useful for investigating 
30 genetic drift, mutation, recombination, and selection, as well as for tracing the origins of populations 
and their migrations (Taylor, J.G. et al. (2001) Trends MoL Med. 7:507-512; Kwok, P.-Y. and Z. Gu 
(1999) Mol. Med. Today 5:538-543; Nowotny, P. et al. (2001) Curr. Opin. Neurobiol. 11:637-641). 
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Methods which may also be used to quantify the expression of NTRAN include radiolabeling 
or biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves (Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et al. (1993) 
Anal. Biochem. 212:229-236). The speed of quantitation of multiple samples may be accelerated by 
5 running the assay in a high-throughput format where the oligomer or polynucleotide of interest is 
presented in various dilutions and a spectrophotometry or colorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotides described herein may be used as elements on a microarray. The microarray can be 

10 used in transcript imaging techniques which monitor the relative expression levels of large numbers of 
genes simultaneously as described below. The microarray may also be used to identify genetic 
variants, mutations, and polymorphisms. This information may be used to determine gene function, to 
understand the genetic basis of a disorder, to diagnose a disorder, to monitor progression/regression of 
disease as a function of gene expression, and to develop and monitor the activities of therapeutic 

15 agents in the treatment of disease. In particular, this information may be used to develop a 

pharmacogenomic profile of a patient in order to select the most appropriate and effective treatment 
regimen for that patient. For example, therapeutic agents which are highly effective and display the . 
fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, NTRAN, fragments of NTRAN, or antibodies specific for NTRAN 

20 may be used as elements on a microarray. The microarray may be used to monitor or measure 

protein-protein interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 

25 quantifying the number of expressed genes and their relative abundance under given conditions and at 
a given time (Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent No. 5,840,484; 
hereby expressly incorporated by reference herein). Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 

30 hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 
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Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo, as in the 
case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which, profile the expression of the polynucleotides of the present invention 

5 may also be used in conjunction with in vitro model systems and preclinical evaluation of 

pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
(Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and NX. Anderson (2000) 

10 Toxicol Lett. 112-1 13 :467-47 1). If a test compound has a signature similar to that of a compound 
with known toxicity, it is likely to share those toxic properties. These fingerprints or signatures are 
most useful and refined when they contain expression information from a large number of genes and 
gene families. Ideally, a genome- wide measurement of expression provides the highest quality 
signature. Even genes whose expression is not altered by any tested compounds are important as 

15 well, as the levels of expression of these genes are used to normalize the rest of the expression data. 
The normalization procedure is useful for comparison of expression data after treatment with different 
compounds. While the assignment of gene function to elements of a toxicant signature aids in 
interpretation of toxicity mechanisms, knowledge of gene function is not necessary for the statistical 
matching of signatures which leads to prediction of toxicity (see, for example, Press Release 00-02 

20 from the National Institute of Environmental Health Sciences, released February 29, 2000, available at 
http://www.niehs.nih.gov/oc/news/toxchip.htm). Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

In an embodiment, the toxicity of a test compound can be assessed by treating a biological 
sample containing nucleic acids with the test compound. Nucleic acids that are expressed in the 

25 treated biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present invention 
may be quantified. The transcript levels in the treated biological sample are compared with levels in 
an untreated biological sample. Differences in the transcript levels between the two samples are 
indicative of a toxic response caused by the test compound in the treated sample. 

30 Another embodiment relates to the use of the polypeptides disclosed herein to analyze the 

proteome of a tissue or cell type. The term proteome refers to the global pattern of protein expression 
in a particular tissue or cell type. Each protein component of a proteome can be subjected individually 
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to further analysis. Proteome expression patterns, or profiles, are analyzed by quantifying the number 
of expressed proteins and their relative abundance under given conditions and at a given time. A 
profile of a cell's proteome may thus be generated by separating and analyzing the polypeptides of a 
particular tissue or cell type. In one embodiment, the separation is achieved using two-dimensional gel 

5 electrophoresis, in which proteins from a sample are separated by isoelectric focusing in the first 

dimension, and then according to molecular weight by sodium dodecyl sulfate slab gel electrophoresis 
in the second dimension (Steiner and Anderson, supra). The proteins are visualized in the gel as 
discrete and uniquely positioned spots, typically by staining the gel with an agent such as Coomassie 
Blue or silver or fluorescent stains. The optical density of each protein spot is generally proportional to 

10 the level of the protein in the sample. The optical densities of equivalently positioned protein spots 
from different samples, for example, from biological samples either treated or untreated with a test 
compound or therapeutic agent, are compared to identify any changes in protein spot density related to 
the treatment. The proteins in the spots are partially sequenced using, for example, standard methods 
employing chemical or enzymatic cleavage followed by mass spectrometry. The identity of the protein 

15 in a spot may be determined by comparing its partial sequence, preferably of at least 5 contiguous 
amino acid residues, to the polypeptide sequences of interest. In some cases, further sequence data 
may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for NTRAN to quantify „ 
the levels of NTRAN expression. In one embodiment, the antibodies are used as elements on a 

20 microarray, and protein expression levels are quantified by exposing the microarray to the sample and 
detecting the levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 
270:103-111; Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by 
a variety of methods known in the art, for example, by reacting the proteins in the sample with a thiol- 
or amino-reactive fluorescent compound and detecting the amount of fluorescence bound at each 

25 array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and 
should be analyzed in parallel with toxicant signatures at the transcript leveL There is a poor 
correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures maybe 
30 useful in the analysis of compounds which do not significantly affect the transcript image, but which 
alter the proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 
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In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 
biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to the amount of the corresponding protein in an untreated biological sample. 
5 A difference in the amount of protein between the two samples is indicative of a toxic response to the 
test compound in the treated sample. Individual proteins are identified by sequencing the amino acid 
residues of the individual proteins and comparing these partial sequences to the polypeptides of the 
present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 

10 sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is compared 
with the amount in an untreated biological sample. A difference in the amount of protein between the 
two samples is indicative of a toxic response to the test compound in the treated sample. 

15 Microarrays may be prepared, used, and analyzed using methods known in the art (Brennan, 

T.M. et al (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl Acad. Sci. USA 
93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251116; Shalon, D. et al. (1995) 
PCT application WO95/35505; Heller, RA. et aL (1997) Proc. Natl. Acad. Sci.. USA 94:2150-2155; 
Heller, MJ. et al. (1997) U.S. Patent No. 5,605,662). Various types of microarrays are well known 

20 and thoroughly described in Schena, M., ed. (1999; DNA Microarrays: A Practical Approach , Oxford 
University Press, London). 

In another embodiment of the invention, nucleic acid sequences encoding NTRAN may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
Either coding or noncoding sequences maybe used, and in some instances, noncoding sequences may 

25 be preferable over coding sequences. For example, conservation of a coding sequence among 
members of a multi-gene family may potentially cause undesired cross hybridization during 
chromosomal mapping. The sequences may be mapped to a particular chromosome, to a specific 
region of a chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (Y ACs), bacterial artificial chromosomes (BACs), bacterial PI 

30 constructions, or single chromosome cDNA libraries (Harrington, J. J. et al. (1997) Nat Genet 15:345- 
355; Price, CM. (1993) Blood Rev. 7:127-134; Trask, B.J. (1991) Trends Genet 7:149-154). Once 
mapped, the nucleic acid sequences maybe used to develop genetic linkage maps, for example, which 
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correlate the inheritance of a disease state with the inheritance of a particular chromosome region or 
restriction fragment length polymorphism (RFLP) (Lander, E.S. and D. Botstein (1986) Proc. Natl. 
Acad. Sci. USA 83:7353-7357). 

Fluorescent in situ hybridization (FISH) maybe correlated with other physical and genetic 
5 map data (Heinz-Ulrich, et aL (1995) in Meyers, supra, pp. 965-968). Examples of genetic map data 
can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding NTRAN on a physical 
map and a specific disorder, or a predisposition to a specific disorder, may help define the region of 
DNA associated with that disorder and thus may further positional cloning efforts. 
10 In situ hybridization of chromosomal preparations and physical mapping techniques, such as 

linkage analysis using established chromosomal markers, maybe used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 
may reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
15 techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 

localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any 
sequences mapping to that area may represent associated or regulatory genes for further investigation 
(Gatti, R.A. et aL (1988) Nature 336:577-580). The nucleotide sequence of the instant invention may 4 
also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 
20 among normal, carrier, or affected individuals. 

In another embodiment of the invention, NTRAN, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intraceHularly. The formation of binding complexes 
25 between NTRAN and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest (Geysen, et aL (1984) PCT application 
WO84/03564). In this method, large numbers of different small test compounds are synthesized on a 
solid substrate. The test compounds are reacted with NTRAN, or fragments thereof, and washed. 
30 Bound NTRAN is then detected by methods well known in the art. Purified NTRAN can also be 
coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 
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In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding NTRAN specifically compete with a test compound for binding 
NTRAN. In this manner, antibodies can be used to detect the presence of any peptide which shares 
one or more antigenic determinants with NTRAN. 

5 In additional embodiments, the nucleotide sequences which encode NTRAN may be used in 

any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 

10 description, utilize the present invention to its fullest extent. The following embodiments are, therefore, 
to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way 
whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, 
including U.S. Ser. No. 60/340,798, U.S. Ser. No. 60/365,645, U.S. Ser. No. 60/367,662, U.S. Ser. 
15 No. 60/379,887, and U.S. Ser. No. 60/384,639, are hereby expressly incorporated by reference. 

EXAMPLES 

L Construction of cDNA Libraries 

Incyte cDNAs were derived from cDNA libraries described in the LDFESEQ GOLD 
20 database (Incyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in guanidinium 
isothiocyanate, while others were homogenized and lysed in phenol or in a suitable mixture of 
denaturants, such as TRIZOL (Thvitrogen), a monophasic solution of phenol and guanidine 
isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted with 
chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and 
25 ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promega), OUGOTEX latex particles 
(QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
30 RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g. , the 
POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
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libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNEAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Invitrogen), using the . 
recommended procedures or similar methods known in the art (Ausubel et al., supra, ch. 5). Reverse 
transcription was initiated using oligo d(T) or random primers. Synthetic oligonucleotide adapters were 

5 ligated to double stranded cDNA, and the cDNA was digested with the appropriate restriction enzyme 
or enzymes. For most libraries, the cDNA was size-selected (300-1000 bp) using SEPHACRYL 
S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column chromatography (Amersham 
Biosciences) or preparative agarose gel electrophoresis. cDNAs were ligated into compatible 
restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid 

10 (Stratagene), PSPORT1 plasmid (Invitrogen, Carlsbad CA), PCDNA2. 1 plasmid (Invitrogen), PBK- 
CMV plasmid (Stratagene), PCR2-TOPOTA plasmid (Invitrogen), PCMV-ICIS plasmid (Stratagene), 
pIGEN (Incyte Genomics, Palo Alto CA), pRARE (Incyte Genomics), or pINCY (Incyte Genomics), 
or derivatives thereof. Recombinant plasmids were transformed into competent E. coli cells including 
XLl-Bhie, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or ElectroMAX DH10B 

15 from Invitrogen. 

II. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 
excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 

20 AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 
96 plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0. 1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 

25 high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
3 84- well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a ELUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 

30 III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation such 
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as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
(MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Biosciences or supplied in ABI sequencing kits such as the 
5 ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEGABACE 1000 DNA sequencing system (Amersham Biosciences); 
the ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard 
ABI protocols and base calling software; or other sequence analysis systems known in the art. 
10 Reading frames within the cDNA sequences were identified using standard methods (Ausubel et al., 
supra, ch. 7). Some of the cDNA sequences were selected for extension using the techniques 
disclosed in Example Vm 

The polynucleotide sequences derived from Incyte cDNAs were validated by removing 
vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 
15 programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 
Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as the GenBahk primate, rodent, mammalian, vertebrate, and eukaryote databases, and. 
BLOCKS, PRINTS, DOMO, PRODOM; PROTEOME databases with sequences from Homo 
sapiens, Rattus norvegicus, Mus musculus, Caenorhabditis elegans, Saccharomyces cerevisiae, 
20 Schizosaccharomyces pombe, and Candida albicans (Incyte Genomics, Palo Alto CA); hidden 

Markov model (HMM)-based protein family databases such as PFAM, INCY, and TIGRFAM (Haft, 
D.H. et al. (2001) Nucleic Acids Res. 29:41-43); and HMM-based protein domain databases such as 
SMART (Schultz, J. et aL (1998) Proc. Natl. Acad. Sci. USA 95:5857-5864; Letunic, I. et al. (2002) 
Nucleic Acids Res. 30:242-244). (HMM is a probabilistic approach which analyzes consensus 
25 primary structures of gene families; see, for example, Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 
6:361-365.) The queries were performed using programs based on BLAST, FASTA, BLIMPS, and 
HMMER. The Incyte cDNA sequences were assembled to produce full length polynucleotide 
sequences. Alternatively, GenBank cDNAs, GenBank ESTs, stitched sequences, stretched 
sequences, or Genscan-predicted coding sequences (see Examples IV and V) were used to extend 
30 Incyte cDNA assemblages to full length. Assembly was performed using programs based on Phred, 
Phrap, and Consed, and cDNA assemblages were screened for open reading frames using programs 
based on GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated 
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to derive the corresponding full length polypeptide sequences. Alternatively, a polypeptide may begin 
at any of the methionine residues of the full length translated polypeptide. Full length polypeptide 
sequences were subsequently analyzed by querying against databases such as the GenBank protein 
databases (genpept), SwissProt, the PROTEOME databases, BLOCKS, PRINTS, DOMO, 

5 PRODOM, Prosite, hidden Markov model (HMM)-based protein family databases such as PFAM, 
1NCY, and TIGRFAM; and HMM-based protein domain databases such as SMART. Full length 
polynucleotide sequences are also analyzed using MACDNASIS PRO software (MiraiBio, Alameda 
CA) and LASERGENE software (DNASTAR). Polynucleotide and polypeptide sequence alignments 
are generated using default parameters specified by the CLUSTAL algorithm as incorporated into the 

10 MEGALIGN multisequence alignment program (DNASTAR), which also calculates the percent 
identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDNA and full length sequences and provides applicable descriptions, references, and threshold 
parameters. The first column of Table 7 shows the tools, programs, and algorithms used, the second 

15 column provides brief descriptions thereof, the third column presents appropriate references, all of 
which are incorporated by reference herein in their entirety, and the fourth column presents, where 
applicable, the scores, probability values, and other parameters used to evaluate the strength of a 
match between two sequences (the higher the score or the lower the probability value, the greater the . 
identity between two sequences). 

20 The programs described above for the assembly and analysis of full length polynucleotide and 

polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:23-44. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies are described in Table 4, column 2. 
IV. Identification and Editing of Coding Sequences from Genomic DNA 

25 Putative neurotransmission-associated proteins were initially identified by running the Genscan 

gene identification program against public genomic sequence databases (e.g., gbpri and gbhtg). 
Genscan is a general-purpose gene identification program which analyzes genomic DNA sequences 
from a variety of organisms (Burge, C. and S. Karlin (1997) J. Mol. Biol. 268:78-94; Burge, C. and S. 
Karlin (1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to 

30 form an assembled cDNA sequence extending from a methionine to a stop codon. The output of 
Genscan is a FASTA database of polynucleotide and polypeptide sequences. The maximum range of 
sequence for Genscan to analyze at once was set to 30 kb. To determine which of these Genscan 



103 



WO 03/051902 



PCT/US02/40059 



predicted cDNA sequences encode neurotransmission-associated proteins, the encoded polypeptides 
were analyzed by querying against PFAM models for neurotransmission-associated proteins. 
Potential neurotransmission-associated proteins were also identified by homology to Incyte cDNA 
sequences that bad been annotated as neurotransmission-associated proteins. These selected 
5 Genscan-predicted sequences were then compared by BLAST analysis to the genpept and gbpri 
public databases. Where necessary, the Genscan-predicted sequences were then edited by 
comparison to the top BLAST hit from genpept to correct errors in the sequence predicted by 
Genscan, such as extra or omitted exons. BLAST analysis was also used to find any Incyte cDNA or 
public cDNA coverage of the Genscan-predicted sequences, thus providing evidence for transcription. 
10 When Incyte cDNA coverage was available, this information was used to correct or confirm the 
Genscan predicted sequence. Full length polynucleotide sequences were obtained by assembling 
Genscan-predicted coding sequences with Incyte cDNA sequences and/or public cDNA sequences 
using the assembly process described in Example JR. Alternatively, full length polynucleotide 
sequences were derived entirely from edited or unedited Genscan-predicted coding sequences. 
15 V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Example , 
DI were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
20 exon predictions from one or more genomic sequences. Each cluster was analyzed using an algorithm 
based on graph theory and dynamic programming to integrate cDNA and genomic information, 
generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
full length sequence. Sequence intervals in which the entire length of the interval was present on 
more than one sequence in the cluster were identified, and intervals thus identified were considered to 
25 be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
sequences, then all three intervals were considered to be equivalent. This process allows unrelated 
but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
thus identified were then "stitched" together by the stitching algorithm in the order that they appear 
along their parent sequences to generate the longest possible sequence, as well as sequence variants. 
30 Linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 
genomic sequence to genomic sequence) were given preference over linkages which change parent 
type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
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by BLAST analysis to the genpept and gbpri public databases. Incorrect exons predicted by Genscan 
were corrected by comparison to the top BLAST hit from genpept. Sequences were further extended 
with additional cDNA sequences, or by inspection of genomic DNA, when necessary. 
"Stretched" Sequences 

5 Partial DNA sequences were extended to full length with an algorithm based on BLAST 

analysis. First, partial cDNAs assembled as described in Example HI were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 
analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 

10 Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 
(HSPs) to map the translated sequences onto the GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for homologous 
genomic sequences from the public human genome databases. Partial DNA sequences were 

15 therefore "stretched" or extended by the addition of homologous genomic sequences. The resultant 
stretched sequences were examined to determine whether it contained a complete gene. 
VI. Chromosomal Mapping of NTRAN Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:23-44 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 

20 implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:23-44 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Gen6thon were used to determine if any of the clustered sequences 

25 had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
ann. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 

30 chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 

humans, although this can vary widely due to hot and cold spots of recombination) The cM distances 
are based on genetic markers mapped by Gen6thon which provide boundaries for radiation hybrid 
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markers whose sequences were included in each of the clusters. Human genome maps and other 
resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
(http://www.ncbi.nlrn.nih.gov/genemap/), can be employed to determine if previously identified disease 
genes map within or in proximity to the intervals indicated above. 

5 Association of a disease with a chromosomal locus can be determined by lod score. Lod 

score is a statistical method used to test the linkage of two or more loci within families having a 
genetic disease. The lod score is the logarithm to base 10 of the odds in favor of linkage. Linkage is 
defined as the tendency of two genes located on the same chromosome to be inherited together 
through meiosis (Genetics in Medicine, Fifth Edition, (1991) Thompson, M.W. et al., W.B. Saunders 

10 Co. Philadelphia). A lod score of +3 or greater (1000:1 odds in favor of linkage) indicates a probability 
of 1 in 1000 that a particular marker was found solely by chance in affected individuals, which is 
strong evidence that two genetic loci are linked. 

One such gene implicated in PD is PARK3, which maps to 2pl3 (Gasser, T. et al. (1998) 
Nature Genet 18:262-265). A marker at chromosomal position D2S441 was found to have a lod 

15 score of 3.2 in the region of PARK3. This marker supported the disease association of PARK3 in 
the chromosomal interval from D2S134 to D2S286 (Gasser et al., supra) . Markers located within 
chromosomal intervals D2S134 and D2S286, which map between 83.88 to 94.05 centiMorgans on the 
short arm of chromosome 2, were used to identify genes that map in the region between D2S134 and 
D2S286. 

20 A second PD gene, implicated in early-onset recessive parkinsonism, is PARK6, located on 

chromosome 1 at Ip35-lp36. Several markers were obtained with lod scores greater than 3 including 
D1S199, D1S2732, D1S2828, D1S478, D1S2702, D1S2734, D1S2674 (Valente, E.M. et al, supra). 
These markers were used to determine the PD-relevant range of chromosome loci and identify 
sequences that map to chromosome 1 between D1S199 and D1S2885. 

25 Restriction fragment length polymorphism (RFLP) markers shown to be near regions of DNA 

known as sequence-tagged sites (STS), have been mapped to NT_Contigs generated by the Human 
Genome Project using ePCR (Schuler, G.D. (1997) Genome Research 7: 541-550, and (1998) Trends 
BiotechnoL 16(ll):456-9). Contigs containing regions of DNA with known disease-associated 
markers are therefore used to identify NTRAN sequences that map to disease-associated regions of 

30 the genome. 

Polynucleotides encoding NTRAN were mapped to NT_Contigs. Contigs longer than 1Mb 
were broken into subcontigs of 1Mb length with overlaping sections of lOOkb. A preliminary step used 
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an algorithm, similar to MEGABLAST, to define the mRNA sequence /masked genomic DNA contig 

pairings. The cDNA/genomic pairings identified by the first algorithm were confirmed, and the 

NTRAN polynucleotides mapped to DNA contigs, using SIM4 (Florea, L. et al. (1998) Genome Res. 

8:967-74, version May 2000) which had been optimized for high throughput processing and strand 
5 assignment confidence). The SIM4 output of the mRNA sequence/genomic contig pairs was further 

processed to determine the correct location of the NTRAN polynucleotides on the genomic contig, as 

well as their strand identity. 

SEQ ID NO:40 was mapped to NT_Contig GBI:NTJ)05428J)01.7 from Genbank release 

February 2002, covering a 9.65 Mb region of the genome that also contains PD-associated genetic 
10 markers D2S134 and D2S286. The maximum distance between SEQ ID NO:40 and markers D2S134 

and D2S286, therefore, is 9.65 Mb. Thus, SEQ ID NO:40 is in proximity with genetic markers shown 

to consistently associate with PD. 

VII, Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
15 gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 

from a particular cell type or tissue have been bound (Sambrook and Russell, supra, ch. 7; Ausubel et 

al, supra, ch. 4). 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is much 

20 faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer search 
can be modified to determine whether any particular match is categorized as exact or similar. The 
basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
5 x minimum {length(Seq. 1), length(Seq. 2)} 

25 The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 

30 (HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 

gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to calculate 
the product score. The product score represents a balance between fractional overlap and quality in a 
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BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
5 identity and 100% overlap. 

Alternatively, polynucleotides encoding NTRAN are analyzed with respect to the tissue 
sources from which they were derived. For example, some full length sequences are assembled, at 
least in part, with overlapping Incyte cDNA sequences (see Example LI). Each cDNA sequence is 
derived from a cDNA library constructed from a human tissue. Each human tissue is classified into 
10 one of the following organ/tissue categories: cardiovascular system; connective tissue; digestive 
system; embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, male; 
germ cells; hemic and immune system; liver; musculoskeletal system; nervous system; pancreas; 
respiratory system; sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract. 
The number of libraries in each category is counted and divided by the total number of libraries across 
15 all categories. Similarly, each human tissue is classified into one of the following disease/condition 

categories: cancer, cell line, developmental, inflammation, neurological, trauma, cardiovascular, pooled, 
and other, and the number of libraries in each category is. counted and divided by the total number of 
libraries across all categories. The resulting percentages reflect the tissue- and disease-specific 
expression of cDNA encoding NTRAN. cDNA sequences and cDNA library/tissue information are 
20 found in the LIFESEQ GOLD database (Incyte Genomics, Palo Alto CA). 
VIII. Extension of NTRAN Encoding Polynucleotides 

Full length polynucleotides are produced by extension of an appropriate fragment of the full 
length molecule using oligonucleotide primers designed from this fragment. One primer was 
synthesized to initiate 5' extension of the known fragment, and the other primer was synthesized to 
25 initiate 3 ' extension of the known fragment. The initial primers were designed using OLIGO 4.06 
software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 
30 Selected human cDNA libraries were used to extend the sequence. If more than one 

extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PGR using methods well known in the art PCR 
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was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , (NH^SO^ 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Biosciences), ELONGASE enzyme 
(Invitrogen), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair 

5 PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair 17 and SK+ were as follows: Step 1: 94 °C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57 °C, 1 min; Step 4: 68 °C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

10 The concentration of DNA in each well was determined by dispensing 100 fil PICOGREEN 

quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 jxl of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fhioroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 

15 concentration of DNA. A 5 (A to 10 }A aliquot of the reaction mixture was analyzed by 

electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the . 
sequence. 

The extended nucleotides were desalted and concentrated, transferred to 3 84- well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 

20 sonicated or sheared prior to religation into pUC 18 vector (Amersham Biosciences). For shotgun 
sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, 
fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Biosciences), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, and 

25 transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 

media, and individual colonies were picked and cultured overnight at 37 °C in 384-weH plates in LB/2x 
carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Biosciences) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 
30 1: 94 °C, 3 min; Step 2: 94 °C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 
4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by 
PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries 
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were reamplified using the same conditions as described above. Samples were diluted with 20% 
dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing primers 
and the DYENAMIC DIRECT kit (Amersham Biosciences) or the ABI PRISM BIGDYE 
Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
5 In like manner, full length polynucleotides are verified using the above procedure or are used 

to obtain 5' regulatory sequences using the above procedure along with oligonucleotides designed for 
such extension, and an appropriate genomic library. 

EX. Identification of Single Nucleotide Polymorphisms in NTRAN Encoding 
Polynucleotides 

10 Common DNA sequence variants known as single nucleotide polymorphisms (SNPs) were 

identified in SEQ ID NO:23-44 using the LIFESEQ database (Incyte Genomics). Sequences from the 
same gene were clustered together and assembled as described in Example DI, allowing the 
identification of all sequence variants in the gene. An algorithm consisting of a series of filters was 
used to distinguish SNPs from other sequence variants. Preliminary filters removed the majority of 

15 basecall errors by requiring a minimum Phred quality score of 15, and removed sequence alignment 
errors and errors resulting from improper trimming of vector sequences, chimeras, and splice variants. 
An automated procedure of advanced chromosome analysis analysed the original chromatogram files 
in the vicinity of the putative SNP. Clone error filters used statistically generated algorithms to identify 
errors introduced during laboratory processing, such as those caused by reverse transcriptase, 

20 polymerase, or somatic mutation. Clustering error filters used statistically generated algorithms to 

identify errors resulting from clustering of close homologs or pseudogenes, or due to contamination by 
non-human sequences. A final set of filters removed duplicates and SNPs found in immunoglobulins 
or T-cell receptors. 

Certain SNPs were selected for further characterization by mass spectrometry using the high 
25 throughput MASSAKRAY system (Sequenom, Inc.) to analyze allele frequencies at the SNP sites in 
four different human populations. The Caucasian population comprised 92 individuals (46 male, 46 
female), including 83 from Utah, four French, three Venezualan, and two Amish individuals. The 
African population comprised 194 individuals (97 male, 97 female), all African Americans. The 
Hispanic population comprised 324 individuals (162 male, 162 female), all Mexican Hispanic. The 
30 Asian population comprised 126 individuals (64 male, 62 female) with a reported parental breakdown 
of 43% Chinese, 31% Japanese, 13% Korean, 5% Vietnamese, and 8% other Asian. Allele 
frequencies were first analyzed in the Caucasian population; in some cases those SNPs which showed 
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no allelic variance in this population were not further tested in the other three populations. 
X. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:23-44 are employed to screen cDNAs, 
genomic DNAs, or niRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 

5 pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /^Ci of 
[y_32p] adenosine triphosphate (Amersham Biosciences), and T4 polynucleotide kinase (DuPont NEN, 
Boston MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 

10 superfine size exclusion dextranbead column (Amersham Biosciences). An aliquot containing 10 7 
counts per minute of the labeled probe is used in a typical membrane-based hybridization analysis of 
human genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco EI, Pst I, 
Xba I, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

15 membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

20 XI. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing; see, e.g., Baldeschweiler et al., supra), 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 
aforementioned technologies should be uniform and solid with a non-porous surface (Schena, M., ed. 

25 (1999) DNA Microarrays: A Practical Approach , Oxford University Press, London). Suggested 

substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
produced using available methods and machines well known to those of ordinary skill in the art and 

30 may contain any appropriate number of elements (Schena, M. et aL (1995) Science 270:467-470; 

Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 
16:27-31). 
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Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 

5 biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 

10 the microarray may be assessed. In one embodiment, microarray preparation and usage is described 
in detail below. 

Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 

15 reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/jil oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/fxl RNase inhibitor, 500 fiM dATP, 500 pM dGTP, 500 /xM dTTP, 40 fM 
dCTP, 40 jliM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Biosciences). The reverse transcription 
reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with GEMBRIGHT kits 
(Incyte Genomics). Specific control poly^A) + RNAs are synthesized by in vitro transcription from 

20 non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one with 
Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and incubated for 
20 minutes at 85° C to the stop the reaction and degrade the RNA. Samples are purified using two 
successive CHROMA SPIN 30 gel filtration spin columns (Clontech, Palo Alto CA) and after 
combining, both reaction samples are ethanol precipitated using 1 ml of glycogen (1 mg/ml), 60 ml 

25 sodium acetate, and 300 ml of 100% ethanol. The sample is then dried to completion using a 

SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended in 14 fil 5X SSC/0.2% SDS. 
Microarray Preparation 

Sequences of the present invention are used to generate array elements. Each array element 
is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 

30 primers complementary to the vector sequences flanking the cDNA insert. Array elements are 

amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 fig. 
Amplified array elements are then purified using SEPHACRYL-400 (Amersham Biosciences). 
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Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
5 coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 110°C 
oven. 

Array elements are applied to the coated glass substrate using a procedure described in U.S. 
Patent No. 5,807,522, incorporated herein by reference. 1 jxl of the array element DNA, at an average 
concentration of 100 ng//il, is loaded into the open capillary printing element by a high-speed robotic 

10 apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60° C followed by washes in 0.2% 

15 SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 fil of sample mixture consisting of 0.2 fig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered with 

20 an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 
fil of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 
6.5 hours at 60° C The arrays are washed for 10 min at 45° C in a first wash buffer (IX SSC, 0.1% 
SDS), three times for 10 minutes each at 45° C in a second wash buffer (0.1X SSC), and dried. 

25 Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 mn for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
30 containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 
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In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
5 emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is . 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
10 the array contains a complementary DNA sequence, allowing the intensity of the signal at that location 
to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples from 
different sources (e.g., representing test and control cells), each labeled with a different fluorophore, 
are hybridized to a single array for the purpose of identifying genes that are differentially expressed, 
the calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and 
1 5 adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RT1-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a , 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
20 signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
spectra) between the fluorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
25 to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte Genomics). Array 
elements that exhibit at least about a two-fold change in expression, a signal-to-background ratio of at 
least about 2.5, and an element spot size of at least about 40%, are considered to be differentially 
expressed. 
30 Expression 

For example, SEQ ID NO:24 showed differential expression in lung squamous carcinoma 
tissues versus normal lung tissues as determined by microarray analysis. The expression of SEQ ID 
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NO:24 was decreased by at least two fold in lung squamous carcinoma tissues relative to grossly 
uninvolved normal lung tissue from the same donors. Therefore, SEQ ID NO:24 is useful as a 
diagnostic marker or as a potential therapeutic target for lung cancer. 

SEQ ID NO:24 also showed differential expression in ovarian adenocarcinoma tissues versus 
5 normal ovarian tissues as determined by microarray analysis. The expression of SEQ ID NO:24 was 
decreased by at least two fold in ovarian adenocarcinoma tissues relative to grossly uninvolved normal 
ovarian tissue from the same donor. Therefore, SEQ ID NO:24 is useful as a diagnostic marker or as 
a potential therapeutic target for ovarian cancer. 

Furthermore, SEQ ID NO:24 showed region-specific gene expression in the human brain as 
10 determined by microarray analysis. The expression of SEQ ID NO:24 was decreased by at least two 
fold in the cervical spinal cord relative to pooled brain tissues which were constituted from the major 
regions of the brain from two male brains; a 47 year old and a 48 year old. The tissue from the 
cervical spinal cord was isolated from a 47 year old male, the same 47 year old donor as in the pooled 
sample. Therefore, SEQ ID NO:24 serves as a useful biomarker for human brains, specifically for the 
15 cervical spinal cord. 

In an alternative example, the expression of SEQ ID NO:25, as determined by microarray 
. analysis, was increased by at least two fold in sigmoid colon tissues relative to normal sigmoid colon 
tissues. The sigmoid colon tumor tissue which originated from a metastatic gastric sarcoma (stromal 
tumor) was harvested from a 48 year old female donor. The normal sigmoid colon tissue was 
20 harvested from grossly uninvolved sigmoid colon tissue of the same donor. Therefore, SEQ ID NO:25 
is useful as a diagnostic marker or as a potential therapeutic target for colon cancer. 

SEQ ID NO:25 also showed decreased expression in tissue affected by lung adenocarcinoma 
versus normal lung tissue as determined by microarray analysis. A sample of right lung tissue that 
showed moderately differentiated adenocarcinoma was compared to grossly uninvolved lung tissue 
25 from the same donor (Huntsman Cancer Institute, Salt Lake City, UT). Therefore, SEQ ID NO:25 is 
useful in diagnostic assays for and monitoring treatment of lung cancer. 

Furthermore, the expression of SEQ ID NO:25, as determined by microarray analysis, was 
increased by at least two fold in Tangier disease-derived fibroblasts relative to normal fibroblasts. 
Both types of cells were cultured in the presence of cholesterol and compared with the same cell type 
30 in the absence of cholesterol. The human fibroblasts were obtained from skin transplants from both 
normal subjects and two patients with homozygous Tangier disease. Cell lines were immortalized by 
transfection with human papillomavirus 16 genes E6 and E7 and a neomycin resistance selectable 
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marker. TD derived cells are deficient in an assay of apoA-I mediated tritiated cholesterol efflux. 
Therefore, SEQ ID NO:25 is useful in diagnostic assays for and monitoring treatment of Tangier 
disease. 

In an alternative example, SEQ ID NO:30 showed decreased expression in lung tissue 

5 affected by squamous cell adenocarcinoma versus normal lung tissue as determined by microarray 
analysis. Grossly uninvolved hing tissue with no visible abnormalities, from a 73 year-old male, was 
compared to lung squamous cell adenocarcinoma tissue from the same donor (Roy Castle 
International Centre for Lung Cancer Research, Liverpool, UK). Therefore, SEQ ID NO:30 is useful 
in monitoring treatment of, and diagnostic assays for, lung cancer and other cell proliferative disorders. 

10 As another example, SEQ ID NO:30 showed decreased expression in tissue affected by 

ovarian tumor versus normal ovary tissue as determined by microarray analysis. A normal ovary from 
a 79 year-old female donor was compared to an ovarian tumor from the same donor (Huntsman 
Cancer Institute, Salt Lake City, UT). Therefore, SEQ ID NO:30 is useful in monitoring treatment of, 
and diagnostic assays for, ovarian cancer and other cell proliferative disorders. 

15 In an alternative example, SEQ ID NO:36 showed decreased expression in colon 

adenocarcinoma tissue versus grossly uninvolved colon tissue. Gene expression profiles were 
obtained by comparing normal colon tissue to colon tumor tissue from the same donor. Therefore, 
SEQ ID NO:36 is useful in monitoring treatment of, and diagnostic assays for, colon cancer and other 
cell proliferative disorders. 

20 In an alternative example, SEQ ID NO:38 showed differential expression, as determined by 

microarray analysis. Array elements that exhibited about at least a two-fold change in expression and 
a signal intensity over 250 units, a signal-to-background ratio of a least 2.5, and an element spot size of 
at least 40% were identified as differentially expressed using the GEMTOOLS program (Incyte 
Genomics). 

25 For example, the expression of cDNA from grossly uninvolved lung tissue from a 66 year-old 

male was compared to lung squamous cell carcinoma tissue from the same donor (Roy Castle 
International Centre for Lung Cancer Research, Liverpool, UK). The expression of SEQ ID NO:38 
showed at least a two-fold increase in expression in the lung tumor tissue when compared to normal 
lung tissue expression levels from the same donor. Thus, SEQ ID NO:38 is useful for monitoring 

30 progression of, and diagnostic assays for, lung cancers. 

In another example, specific dissected brain regions from a normal 61-year-old female were 
compared to dissected regions from the brain of a 79-year-old-female with severe Alzheimer's 
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disease (AD). The diagnosis of normal or severe AD was established by a certified neuropathologist 
based on microscopic examination of multiple sections throughout the brain. The expression of SEQ 
ID NO:38 showed at least a two-fold decrease in expression in the amygdala and anterior 
hippocampus brain regions when compared to the same brain regions from the normal 61-year-old 
5 female. Therefore, SEQ ID NO:38 is also useful for monitoring progression of, and diagnostic assays 
for, Alzheimer's disease. 

In another example, SEQ ID NO:41 showed tissue-specific expression. RNA samples 
isolated from a variety of normal human tissues were compared to a common reference sample. 
Tissues contributing to the reference sample were selected for their ability to provide a complete 
10 distribution of RNA in the human body and include brain (4%), heart (7%), kidney (3 %), lung (8%), 
placenta (46%), small intestine (9%), spleen (3%), stomach (6%), testis (9%), and uterus (5%). The 
normal tissues assayed were obtained from at least three different donors. RNA from each donor 
was separately isolated and individually hybridized to the microarray. Since these hybridization 
experiments were conducted using a common reference sample, differential expression values are 
15 directly comparable from one tissue to another. The expression of SEQ ID NO:41 was increased by 
at least two-fold in brain (amygdaloid body, occipital cortex, and parietal cortex) tissues as compared 
to the reference sample. Therefore, SEQ ID NO:41 can be used as a marker for brain tissues. 
XII- Complementary Polynucleotides 

Sequences complementary to the NTRAN-encoding sequences, or any parts thereof, are used 
20 to detect, decrease, or inhibit expression of naturally occurring NTRAN. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of NTRAN. 
To inhibit transcription, a complementary oligonucleotide is designed from the most unique 5* sequence 
25 and used to prevent promoter binding to the coding sequence. To inhibit translation, a complementary 
oligonucleotide is designed to prevent ribosomal binding to the NTRAN-encoding transcript. 
XHL Expression of NTRAN 

Expression and purification of NTRAN is achieved using bacterial or virus-based expression 
systems. For expression of NTRAN in bacteria, cDNA is subcloned into an appropriate vector 
30 containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tad) hybrid 
promoter and the T5 or 17 bacteriophage promoter in conjunction with the lac operator regulatory 
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element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express NTRAN upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of NTRAN in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Autographica calif ornica nuclear polyhedrosis virus 

5 (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding NTRAN by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodopterafrugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 

10 Infection of the latter requires additional genetic modifications to baculovirus (Engelhard, E.K. et al. 
(1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937- 
1945). 

In most expression systems, NTRAN is synthesized as a fusion protein with, e.g., glutathione 
S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 

15 affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma japonicum, enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Biosciences). 
Following purification, the GST moiety can be proteolytically cleaved from NTRAN at specifically 
engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification using 

20 commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a 
stretch of six consecutive histidine residues, enables purification on metal-chelate resins (QIAGEN). 
; Methods for protein expression and purification are discussed in Ausubel et al. {supra, ch. 10 and 16). 
Purified NTRAN obtained by these methods can be used directly in the assays shown in Examples 
XVII and XVHI, where applicable. 

25 XIV. Functional Assays 

NTRAN function is assessed by expressing the sequences encoding NTRAN at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
mammalian expression vector containing a strong promoter that drives high levels of cDNA 
expression. Vectors of choice include PCMV SPORT plasmid (Invitrogen, Carlsbad CA) and 

30 PCR3.1 plasmid (Invitrogen), both of which contain the cytomegalovirus promoter. 5-10 \x% of 
recombinant vector are transiently transfected into a human cell line, for example, an endothelial or 
hematopoietic cell line, using either liposome formulations or electroporation. 1-2 \i% of an additional 
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plasmid containing sequences encoding a marker protein are co-transfected. Expression of a marker 
protein provides a means to distinguish transfected cells from nontransfected cells and is a reliable 
predictor of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., 
Green Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry 
5 (FCM), an automated, laser optics-based technique, is used to identify transfected cells expressing 
GFP or CD64-GFP and to evaluate the apoptotic state of the cells and other cellular properties. FCM 
detects and quantifies the uptake of fluorescent molecules that diagnose events preceding or 
coincident with cell death. These events include changes in nuclear DNA content as measured by 
staining of DNA with propidium iodide; changes in cell size and granularity as measured by forward 

10 light scatter and 90 degree side light scatter; down-regulation of DNA synthesis as measured by 
decrease inbromodeoxyuridine uptake; alterations in expression of cell surface and intracellular 
proteins as measured by reactivity with specific antibodies; and alterations in plasma membrane 
composition as measured by the binding of fluorescein-conjugated Annexin V protein to the cell 
surface. Methods in flow cytometry are discussed in Ormerod, M.G. (1994; Flow Cytometry . Oxford, 

15 New York NY). 

The influence of NTRAN on gene expression can be assessed using highly purified 
populations of cells transfected with sequences encoding NTRAN and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions ., 
of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected 

20 cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 
Success NY). mRNA can be purified from the cells using methods well known by those of skill in the 
art. Expression of mRNA encoding NTRAN and other genes of interest can be analyzed by northern 
analysis or microarray techniques. 
XV. Production of NTRAN Specific Antibodies 

25 NTRAN substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g. , 

Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize animals (e.g., rabbits, mice, etc.) and to produce antibodies using standard protocols. 

Alternatively, the NTRAN amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 

30 synthesized and used to raise antibodies by means known to those of skill in the art Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art (Ausubel et al., supra, ch. 11). 
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Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysucciniroide ester (MBS) to 
increase innnunogenicity (Ausubel et al., supra). Rabbits are immunized with the oligopeptide-KLH 
5 complex in complete Freund's adjuvant Resulting antisera are tested for antipeptide and anti-NTRAN 
activity by, for example, binding the peptide or NTRAN to a substrate, blocking with 1% BSA, 
reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XVI. Purification of Naturally Occurring NTRAN Using Specific Antibodies 
Naturally occurring or recombinant NTRAN is substantially purified by immunoaffinity 

10 chromatography using antibodies specific for NTRAN. An immunoaffinity column is constructed by 
covalently coupling anti-NTRAN antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Biosciences). After the coupling, the resin is blocked and 
washed according to the manufacturer's instructions. 

Media containing NTRAN are passed over the immunoaffinity column, and the column is 

15 washed under conditions that allow the preferential absorbance of NTRAN (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/NTRAN binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and NTRAN is collected. 

XVII. Identification of Molecules Which Interact with NTRAN 

20 NTRAN, or biologically active fragments thereof, are labeled with Bolton-Hunter reagent 

(Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539). Candidate molecules previously 
arrayed in the wells of a multi-well plate are incubated with the labeled NTRAN, washed, and any 
wells with labeled NTRAN complex are assayed. Data obtained using different concentrations of 
NTRAN are used to calculate values for the number, affinity, and association of NTRAN with the 

25 candidate molecules. 

Alternatively, molecules interacting with NTRAN are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989; Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 
NTRAN may also be used in the PATHCALUNG process (CuraGen Corp., New Haven 

30 CT) which employs the yeast two-hybrid system in a high-throughput manner to determine all 
interactions between the proteins encoded by two large libraries of genes (Nandabalan, K. et aL 
(2000) U.S. Patent No. 6,057,101). 
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XVin. Demonstration of NTRAN Activity 

An assay for NTRAN activity measures the expression of NTRAN on the cell surface. 
cDNA encoding NTRAN is transfected into an appropriate mammalian cell line. Cell surface proteins 
are labeled withbiotin as described (de la Fuente, M.A. et al. (1997) Blood 90:2398-2405). 
5 Immunoprecipitations are performed using NTRAN-specific antibodies, and immunoprecipitated 
samples are analyzed using sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) 
and immunoblotting techniques. The ratio of labeled immunoprecipitant to unlabeled 
immunoprecipitant is proportional to the amount of NTRAN expressed on the cell surface. 
In the alternative, an assay for NTRAN activity is based on a prototypical assay for 
10 ligand/receptor-mediated modulation of cell proliferation. This assay measures the rate of DNA 

synthesis in Swiss mouse 3T3 cells. A plasmid containing polynucleotides encoding NTRAN is added 
to quiescent 3T3 cultured cells using transfection methods well known in the art. The transiently 
transfected cells are then incubated in the presence of [ 3 H]thymidine, a radioactive DNA precursor 
molecule. Varying amounts of NTRAN ligand are then added to the cultured cells. Incorporation of 
15 [ 3 H]thymidine into acid-precipitable DNA is measured over an appropriate time interval using a 
radioisotope counter, and the amount incorporated is directly proportional to the amount of newly 
synthesized DNA. A linear dose-response curve over at least a hundred-fold NTRAN ligand 
concentration range is indicative of receptor activity. One unit of activity per milliliter is defined as the. 
concentration of NTRAN producing a 50% response level, where 100% represents maximal 
20 incorporation of pH]thymidine into acid-precipitable DNA (McKay, I. and I. Leigh, eds. (1993) 
Growth Factors: A Practical Approach , Oxford University Press, New York NY, p. 73.) 

In a further alternative, the assay for NTRAN activity is based upon the ability of GPCR 
family proteins to modulate G protein-activated second messenger signal transduction pathways (e.g., 
cAMP; Gaudin, P. et al. (1998) J. Biol. Chem. 273:4990-4996). A plasmid encoding full length 
25 NTRAN is transfected into a mammalian cell line (e.g. , Chinese hamster ovary (CHO) or human 
embryonic kidney (HEK-293) cell lines) using methods well-known in the art. Transfected cells are 
grown in 12-well trays in culture medium for 48 hours, then the culture medium is discarded, and the 
attached cells are gently washed with PBS. The cells are then incubated in culture medium with or 
without ligand for 30 minutes, then the medium is removed and cells lysed by treatment with 1 M 
30 perchloric acid. The cAMP levels in the lysate are measured by radioimmunoassay using methods 
well-known in the art. Changes in the levels of cAMP in the lysate from cells exposed to ligand 
compared to those without ligand are proportional to the amount of NTRAN present in the transfected 
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cells. 

To measure changes in inositol phosphate levels, the cells are grown in 24-well plates 
containing 1x10 s cells/well and incubated with inositol-free media and [^myoinositol, 2 mCi/well, for 
48 hr. The culture medium is removed, and the cells washed with buffer containing 10 mM LiCl 
5 followed by addition of ligand. The reaction is stopped by addition of perchloric acid. Inositol 

phosphates are extracted and separated on Dowex AG1-X8 (Bio-Rad) anion exchange resin, and the 
total labeled inositol phosphates counted by liquid scintillation. Changes in the levels of labeled inositol 
phosphate from cells exposed to ligand compared to those without ligand are proportional to the 
amount of NTRAN present in the transfected cells. 

10 In a further alternative, the ion conductance capacity of NTRAN is demonstrated using an 

electrophysiological assay. NTRAN is expressed by transforming a mammalian cell line such as 
COS7, HeLa or CHO with a eukaryotic expression vector encoding NTRAN. Eukaryotic expression 
vectors are commercially available, and the techniques to introduce them into cells are well known to 
those skilled in the art. A small amount of a second plasmid, which expresses any one of a number of 

15 marker genes such as (3-galactosidase, is co-transformed into the cells in order to allow rapid 
identification of those cells which have taken up and expressed the foreign DNA. The cells are 
incubated for 48-72 hours after transformation under conditions appropriate for the cell line to allow 
expression and accumulation of NTRAN and P-galactosidase. Transformed cells expressing p- 
galactosidase are stiained blue when a suitable colorimetric substrate is added to the culture media 

20 under conditions that are well known in the art. Stained cells are tested for differences in membrane 
conductance due to various ions by electrophysiological techniques that are well known in the art. 
Untransformed cells, and/or cells transformed with either vector sequences alone or P-galactosidase 
sequences alone, are used as controls and tested in parallel. The contribution of NTRAN to cation or 
anion conductance can be shown by incubating the cells using antibodies specific for either NTRAN. 

25 The respective antibodies will bind to the extracellular side of NTRAN, thereby blocking the pore in 
the ion channel, and the associated conductance. To study the dependence of NAP on external ions 
sodium can be replaced by choline or N-methyl-D-gtacamine and chloride by gluconate, N0 3 , or S0 4 
(Kavanaugh, M.P. et al. (1992) J. Biol. Chem. 267:22007-22009). 

In a further alternative, NTRAN transport activity is assayed by measuring uptake of labeled 

30 substrates into Xenopus laevis oocytes. Oocytes at stages V and VI are injected with NTRAN 

mRNA (10 ng per oocyte) and incubated for 3 days at 18 °C in OR2 medium (82.5 mM NaCl, 2.5 mM 
KC1, 1 mM CaC^, 1 mM MgC^, 1 mM Na 2 HP0 4 , 5 mM Hepes, 3.8 mM NaOH , 50 jug/ml 
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gentamycin, pH 7.8) to allow expression of NTRAN protein. Oocytes are then transferred to 
standard uptake medium (100 mM NaCl, 2 mM KC1, 1 mM CaCL>, 1 mM MgC^, 10 mM Hepes/Tris 
pH 7.5). Uptake of various substrates (e.g., amino acids, sugars, drugs, and neurotransmitters) is 
initiated by adding a 3 H substrate to the oocytes. After incubating for 30 minutes, uptake is terminated 
5 by washing the oocytes three times in NaMree medium, measuring the incorporated 3 H, and 

comparing with controls. NTRAN activity is proportional to the level of internalized 3 H substrate. 

In a further alternative, NTRAN activity can be demonstrated using an electrophysiological 
assay for ion conductance. Capped NTRAN mRNA transcribed with T7 polymerase is injected into 
defolliculated stage V Xenopus oocytes, similar to the previously described method. Two to seven 
10 days later, transport is measured by two-electrode voltage clamp recording. Two-electrode voltage 
clamp recordings are performed at a holding potential of 50 mV. The data are filtered at 10 Hz and 
recorded with the MacLab digital-to-analog converter and software for data acquisition and analysis 
(AD Instruments, Castle Hill, Australia). To study the dependence of NTRAN on external ions, 
sodium can be replaced by choline or N-methyl-D-ghicamine and chloride by gluconate, N0 3 , or S0 4 
15 (Kavanaugh, M.P. et al. (1992) J. Biol. Chem. 267:22007-22009). 

Calmodulin-binding activity of NTRAN can be demonstrated by mixing 1.5 ml of 0.2 jiM 
calmodulin with increasing concentrations of NTRAN (30 \i\ in concentrations ranging from 0.5 to 3 . 1 
jxM). Analysis of binding activity is determined by Scatchard representation. The bound NTRAN is 
calculated as AN/AN^^ x [total calmodulin]. The free NTRAN concentration is calculated as [total 
20 NTRAN] - [bound NTRAN]. (Bosc, C et al. (2001) J. BioL Chem. 276:30904-30913). 

Alternatively, calmodulin-binding activity of NTRAN can be determined in a binding assay 
using calmodulin Sepharose beads. Calmodulin Sepharose 6B beads (1 mg/ml calmodulin) are 
equilibrated in 20 mM Hepes-KOH, pH 7.2, 0. 15 M NaCl, 5 mg/ml bovine serum albumin, 0.02% 
Triton X-100 (v/v) (buffer C) and either 2.5 mM CaCl^ or 5 mM EGTA. Samples of NTRAN are 
25 incubated with 5 fil of calmodulin-Sepharose 6B beads for 3 hours at 22 °C in 0.5 ml of buffer C in the 
presence or absence (5 mM EGTA) of 2.5 mM CaClj. Subsequently, beads are pelleted by 
centrifugation at 1000 x g and washed five times (1.5 ml each wash) with modified buffer C (Triton 
X-100 concentration is increased to 0.1% (v/v) and albumin is omitted), by resuspension and 
centrifugation at 1000 x g. Washed beads containing bound ligand are incubated in 0. 1 ml of SDS gel 
30 loading buffer for 5 minutes at 95 °C. Bound NTRAN is visualized by Western immunoblotting. 
(Rossi, E.A. et al. (1999) J. Biol. Chem. 274:27201-27210). 

Sialic acid-binding activity of NTRAN can be demonstrated in a COS cell binding assay. 
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COS cells are transfected with plasmids encoding full-length NTRAN using DEAE-dextran as 
described in Simmons D.L. (1993) Cloning cell surface molecules by transient expression in 
mammalian cells. I n Cellular Interactions in Development — A Practical Approach . Edited by Hartley 
D.A.. Oxford: ERL Press, 93-128. On the third day after transfection, binding assays with 

5 erythrocytes are carried out for 1 hour at 37 °C in DMEM culture medium with 0.2 % BSA. Unbound 
erythrocytes are washed off and the cells are fixed in 0.25 % glutardialdehyde. To assess the effect 
of antibodies on binding, COS cells are pre-incubated with immunoglobulins at 20 fig/ml for 1 hour 
prior to the addition of erythrocytes (Kelm, S. et al (1994) Curr. Biol. 4:965-972). 

In the alternative, choline transporter activity or choline-transporter-like CTL1 protein activity 

10 of NTRAN is determined by measuring choline uptake by yeast transformed with expression vectors 
harboring polynucleotides encoding NTRAN. The assay is performed in nitrogen-free medium at 
30°C for 10 or 30 min in the presence of 25 nM [ 3 H]choline. The cells are then filtered, and washed. 
The amount of [ 3 H]choline present in the cells is proportional to the activity of NTRAN in the cells 
(O'Regan, S. supra). 

15 In a further alternative, NTRAN protein kinase (PK) activity is measured by phosphorylation 

of a protein substrate using gamma-labeled p^l-ATP and quantitation of the incorporated 
radioactivity using a gamma radioisotope counter. NTRAN is incubated with the protein substrate, 
[ 32 P]~ATP, and an appropriate kinase buffer. The 32 P incorporated into the product is separated from, 
free pPl-ATP by electrophoresis and the incorporated 32 P is counted. The amount of 32 P recovered 

20 is proportional to the PK activity of NTRAN in the assay. A determination of the specific amino acid 
residue phosphorylated is made by phosphoamino acid analysis of the hydrolyzed protein. 

An assay for NTRAN activity measures the expression of NTRAN on the cell surface. 
cDNA encoding NTRAN is transfected into a non-leukocytic cell line. Cell surface proteins are 
labeled withbiotin (de la Fuente, M.A. et al. (1997) Blood 90:2398-2405). Immunoprecipitations are 

25 performed using NTRAN-specific antibodies, and immunoprecipitated samples are analyzed using 
SDS-PAGE and immunoblotting techniques. The ratio of labeled immunoprecipitant to unlabeled 
immunoprecipitant is proportional to the amount of NTRAN expressed on the cell surface. 

Alternatively, an assay for NTRAN activity measures the amount of cell aggregation induced 
by overexpression of NTRAN. In this assay, cultured cells such as NDH3T3 are transfected with 

30 cDNA encoding NTRAN contained within a suitable mammalian expression vector under control of a 
strong promoter. Cotransfection with cDNA encoding a fluorescent marker protein, such as Green 
Fluorescent Protein (CLONTECH), is useful for identifying stable transfectants. The amount of cell 
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agglutination, or clumping, associated with transfected cells is compared with that associated with 
untransfected cells. The amount of cell agglutination is a direct measure of NTRAN activity. 

Alternatively, an assay for NTRAN activity measures the disruption of cytoskeletal filament 
networks upon overexpression of NTRAN in cultured cell lines (Rezniczek, G. A. et al (1998) J. Cell 

5 Biol. 141:209-225). cDNA encoding NTRAN is subcloned into a mammalian expression vector that 
drives high levels of cDNA expression. This construct is transfected into cultured cells, such as rat 
kangaroo PtK2 or rat bladder carcinoma 804G cells. Actin filaments and intermediate filaments such 
as keratin and vimentin are visualized by immunofluorescence microscopy using antibodies and 
techniques well known in the art. The configuration and abundance of cyoskeletal filaments can be 

10 assessed and quantified using confocal imaging techniques. In particular, the bundling and collapse of 
cytoskeletal filament networks is indicative of NTRAN activity. 

Alternatively, cell adhesion activity in NTRAN is measured in a 9 6- well plate in which wells 
are first coated with NTRAN by adding solutions of NTRAN of varying concentrations to the wells. 
Excess NTRAN is washed off with saline, and the wells incubated with a solution of 1% bovine 

15 serum albumin to block non-specific cell binding. Aliquots of a cell suspension of a suitable cell type 
are then added to the wells and incubated for a period of time at 37 °C. Non-adherent cells are 
washed off with saline and the cells stained with a suitable cell stain such as Coomassie blue. The 
intensity of s tainin g is measured using a variable wavelength multi-well plate reader and compared to a 
standard curve to determine the number of cells adhering to the NTRAN coated plates. The degree 

20 of cell staining is proportional to the cell adhesion activity of NTRAN in the sample. 

Various modifications and variations of the described compositions, methods, and systems of 
the invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. It will be appreciated that the invention provides novel and useful proteins, and their 

25 encoding polynucleotides, which can be used in the drug discovery process, as well as methods for 
using these compositions for the detection, diagnosis, and treatment of diseases and conditions. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Nor should the description of such embodiments be considered exhaustive or limit the invention to the 

30 precise forms disclosed. Furthermore, elements from one embodiment can be readily recombined with 
elements from one or more other embodiments. Such combinations can form a number of 
embodiments within the scope of the invention. It is intended that the scope of the invention be 
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Signature Sequences, Domains and Motifs 


Cell attachment sequence: R256-D258 


signal_cleavage: Ml -A22 


|Signal Peptide: M4-A22, M1-A22 


Leucine Rich Repeat: N261-L284, N213-E236, N93-P1 16, D310-P334, 
N237-V260, Nl 89-1212, H285-P309, N141-H164, N165-P188, Q117- 
S140 


! Leucine rich repeat N-terminal domain: D28-P68 | 
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PROTEIN BAC CLONE RGl 18D07 FROM 7Q31 COMPLETE 
SEQUENCE GLIOMA AMPLIFIED PD032016: M283-L349 


PROTEINBAC CLONE RGl 18D07 FROM7Q31 COMPLETE 
SEQUENCE NEURONAL LEUCINERICH PD080235: P188-E236 


BAC CLONE RG118D07 FROM 7Q31 COMPLETE SEQUENCE 
GLIOMA AMPLIFIED ON PD080342: N237-N282 


Potential Phosphorylation Sites: S51 S173 S242 T42 T95 


Potential Glycosylation Sites: N93 N103 N223 
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Signature Sequences, Domains and Motifs 


LGN motif, putative GEFs specific for G-alpha: M63-V85, Q23-L45 


[signaLcleavage: M38-S96 


PURKINJE CELL PROTEIN 2 PCD5 CELL SPECIFIC L7 
NEURONE PD067101: M38-P136 


[Potential Phosphorylation Sites: S55 S65 S127 T52 T114 


(signaLcleavage: M1-G34 


(Signal Peptide: M 1 -G34 


(Potential Phosphorylation Sites: S33 S48 S153 S178 | 


(Potential Glycosylation Sites: N64 


[signaLcleavage: M1-G35 


Signal Peptide: P 1 8-G35 


| Signal Peptide: L16-G35 \ 


1 Signal Peptide: P13-G35 I 


Signal Peptide: R11-G35 


Signal Peptide: M1-G35 


Signal Peptide: A14-G35 


Signal Peptide: P13-G37 | 


Fibronectin type 3 domain: P424-S507, A521-G603 I 


Immunoglobulin: P336-L418, P244-Q327, P145-S228, P48-A136 [ 


Immunoglobulin C-2 Type: P342-G407, V54-G124, E151-S216, T250- 
T314 


Fibronectin type HI domain: P424-S510, A521-T606 


Immunoglobulin domain: G344-A402, H252-A309, G56-A119, G153- 
A211 


I type Ig domains from SCOP: H335-G425, K236-F332, E41-D140 I 


Ig superfamily from SCOP: F332-L423, F141-G231, 1240-A330, F44- 
1133 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting 
5 ofSEQIDNO:l-22, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-2, SEQ ED NO:5-6, SEQ ID NO:10, SEQ ID NO:15, and SEQ ID NO:17-19, 

c) a polypeptide comprising a naturally occurring amino acid sequence at least 95% 
10 identical to the amino acid sequence of SEQ ID NO:3, 

d) a polypeptide comprising a naturally occurring amino acid sequence at least 96% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:9,andSEQIDNO:14, 

e) a polypeptide comprising a naturally occurring amino acid sequence at least 97% 
15 identical to the amino acid sequence of SEQ ID NO:7, 

f) a polypeptide comprising a naturally occurring amino acid sequence at least 99% 
identical to an amino acid sequence selected from the group consisting of of SEQ ID 
NO:13, and SEQ ID NO:20, 

g) a polypeptide consisting essentially of a naturally occurring amino acid sequence at 
20 least 90% identical to an amino acid sequence selected from the group consisting of 

of SEQ ID NO:4, SEQ ID NO:8, SEQ ID NO:ll-12, SEQ ID NO:16, and SEQ ID 
NO:21-22, 

h) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22, and 

25 i) an immunogenic fragment of a polypeptide having an amino acid sequence selected 

from the group consisting of SEQ ID NO:l-22. 

2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-22. 

30 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 
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4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected from 
the group consisting of SEQ ED NO:23-44. 

5 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 



10 



7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 



9. A method of producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein 
15 said cell is transformed with a recombinant polynucleotide, and said recombinant 

polynucleotide comprises a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide of claim 1, and 

b) recovering the polypeptide so expressed. 

20 10. A method of claim 9, wherein the polypeptide comprises an amino acid sequence selected 

from the group consisting of SEQ ID NO:l-22. 

1 1 . An isolated antibody which specifically binds to a polypeptide of claim 1 . 

25 12. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:23-44, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ 

30 ID NO:23-28, SEQ ID NO:3 1-34, SEQ ID NO:36-39, and SEQ ID NO:41, 

c) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
93% identical to the polynucleotide sequence of SEQ ID NO:40, 
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d) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
94% identical to the polynucleotide sequence of SEQ ID NO:30, 

e) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
96% identical to the polynucleotide sequence of SEQ ID NO:44, 

f) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
97% identical to the polynucleotide sequence of SEQ ID NO:35, 

g) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
99% identical to the polynucleotide sequence of SEQ ID NO:43, 

h) a polynucleotide consisting essentially of a naturally occurring polynucleotide 
sequence at least 90% identical to a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:29, and SEQ ID NO:42 

i) a polynucleotide complementary to a polynucleotide of a), 
j) a polynucleotide complementary to a polynucleotide of b), 
k) a polynucleotide complementary to a polynucleotide of c), 
1) a polynucleotide complementary to a polynucleotide of d), 
m) a polynucleotide complementary to a polynucleotide of e), 
n) a polynucleotide complementary to a polynucleotide of f), 
o) a polynucleotide complementary to a polynucleotide of g), 

p) a polynucleotide complementary to a polynucleotide of h), and 
q) an RNA equivalent of a>p). 



13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 

14, A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions 
whereby a hybridization complex is formed between said probe and said target 
polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
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present, the amount thereof. 

15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 

16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutical^ acceptable 
excipient. 

18. A composition of claim 17, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ED NO:l-22. 

19. A method for treating a disease or condition associated with decreased expression of 
functional NTRAN, comprising administering to a patient in need of such treatment the composition of 
claim 17. 

20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

21. A composition comprising an agonist compound identified by a method of claim 20 and a 
pharmaceutically acceptable excipient. 

22. A method for treating a disease or condition associated with decreased expression of 
functional NTRAN, comprising administering to a patient in need of such treatment a composition of 
claim 21. 
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23. A method of screening a compound for effectiveness as an antagonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

5 

24. A composition comprising an antagonist compound identified by a method of claim 23 and 
a pharmaceutical^ acceptable excipient 

25. A method for treating a disease or condition associated with overexpression of functional 
10 NTRAN, comprising administering to a patient in need of such treatment a composition of claim 24. 

26. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
15 conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby 
identifying a compound that specifically binds to the polypeptide of claim 1. 

27. A method of screening for a compound that modulates the activity of the polypeptide of 
20 claim 1, the method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test 
compound, and 

25 c) comparing the activity of the polypeptide of claim 1 in the presence of the test 

compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 
presence of the test compound is indicative of a compound that modulates the activity 
of the polypeptide of claim 1. 

30 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
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comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in the presence of varying 
amounts of the compound and in the absence of the compound. 

29. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucleic acids with the test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 
whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide comprising a 
polynucleotide sequence of a polynucleotide of claim 12 or fragment thereof, 

c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample with 
the amount of hybridization complex in an untreated biological sample, wherein a 
difference in the amount of hybridization complex in the treated biological sample is 
indicative of toxicity of the test compound. 

30. A method for a diagnostic test for a condition or disease associated with the expression of 
NTRAN in a biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 11, under conditions suitable 
for the antibody to bind the polypeptide and form an antibody:polypeptide complex, 
and 

b) detecting the complex, wherein the presence of the complex correlates with the 
presence of the polypeptide in the biological sample. 

3 1. The antibody of claim 11, wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab') 2 fragment, or 
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e) a humanized antibody. 

32. A composition comprising an antibody of claim 11 and an acceptable excipient. 

5 33. A method of diagnosing a condition or disease associated with the expression of NTRAN 

in a subject, comprising administering to said subject an effective amount of the composition of claim 
32. 

34. A composition of claim 32, further comprising a label. 

10 

35. A method of diagnosing a condition or disease associated with the expression of NTRAN 
in a subject, comprising administering to said subject an effective amount of the composition of claim 
34. 

15 36. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 

11, the method comprising: 

a) immunizin g an animal with a polypeptide consisting of an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-22, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 
20 b) isolating antibodies from the animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which specifically binds to a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-22. 

25 37. A polyclonal antibody produced by a method of claim 36. 

38. A composition comprising the polyclonal antibody of claim 37 and a suitable carrier. 

39. A method of making a monoclonal antibody with the specificity of the antibody of claim 
30 1 1 , the method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 

selected from the group consisting of SEQ ID NO:l-22, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 
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b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
antibody-producing hybridoma cells, 

d) culturing the hybridoma cells, and 

5 e) isolating from the culture monoclonal antibody which specifically binds to a 

polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQE)NO:l-22. 

40. A monoclonal antibody produced by a method of claim 39. 

10 

41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 

42. The antibody of claim 11, wherein the antibody is produced by screening a Fab expression 

library. 

15 

43. The antibody of claim 11, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

44. A method of detecting a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO:l-22 in a sample, the method comprising: 

a) incubating the antibody of claim 11 with the sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 

b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO:l-22 in the sample. 



20 



25 



45. A method of purifying a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-22 from a sample, the method comprising: 

a) incubating the antibody of claim 11 with the sample under conditions to allow specific 
30 binding of the antibody and the polypeptide, and 

b) separating the antibody from the sample and obtaining the purified polypeptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-22. 
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46. A microarray wherein at least one element of the microarray is a polynucleotide of claim 

13. 

47. A method of generating an expression profile of a sample which contains polynucleotides, 
the method comprising: 

a) labeling the polynucleotides of the sample, 

h) contacting the elements of the microarray of claim 46 with the laheled polynucleotides 
of the sample under conditions suitable for the formation of a hybridization complex, 
and 

c) quantifying the expression of the polynucleotides in the sample. 

48. An array comprising different nucleotide molecules affixed in distinct physical locations 
on a solid substrate, wherein at least one of said nucleotide molecules comprises a first oligonucleotide 
or polynucleotide sequence specifically hybridizable with at least 30 contiguous nucleotides of a target 
polynucleotide, and wherein said target polynucleotide is a polynucleotide of claim 12. 

49. An airay of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 30 contiguous nucleotides of said target polynucleotide. 

50. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 60 contiguous nucleotides of said target polynucleotide. 

51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to said target polynucleotide. 

52. An array of claim 48, which is a microarray. 

53. An array of claim 48, further comprising said target polynucleotide hybridized to a 
nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 
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55. An array of claim 48, wherein each distinct physical location on the substrate contains 
multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct physical 
location have the same sequence, and each distinct physical location on the substrate contains 
nucleotide molecules having a sequence which differs from the sequence of nucleotide molecules at 

5 another distinct physical location on the substrate. 

56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l. 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 

10 

58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 
15 60. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

20 

63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 
25 65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:10. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:ll. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:12. 

30 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 13. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:14. 
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70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:15. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:18. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:19. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 

76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

78. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:23. 

79. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:24. 

80. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:25. 

81. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:26. 

82. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:27. 

83. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:28. 

84. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:29. 

85. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:30. 

86. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:31. 
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87. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:32. 

88. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:33. 

89. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:34. 

90. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:35. 

91. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:36. 

92. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:37. 

93. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:38. 

94. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:39. 

95. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:40. 

96. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:41. 

97. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:42. 

98. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:43. 

99. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID NO:44. 
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<110> INCYTE GENOMICS , INC. 
BAUGHN, Mariah R. 
BHATIA, Umesh G . 
BLAKE, Julie J. 
BURRILL, John D. 
ELLIOT, Vicki S. 
EMERLING, Brooke M. 
FORSYTHE , Ian J. 
GIETZEN, Kimberly J. 
GORVAD, Ann E. 
GRIFFIN, Jennifer A. 
HAFALIA, April J. A, 
HO, Anne 
JACKSON, Alan A. 
JIANG, Xin 
KABLE, Amy E . 
KEARNEY Liam 
KHARE, Reena 
LEE, Ernestine A. 
LEE, Sally 
LU, Dyung Aina M. 
MARQUIS, Joseph P. 
LEHR-MASON, Patricia M. 
RAMKUMAR, Jayalaxmi 
RICHARDSON Thomas W. 
S PRAGUE, William W. 
TRAN, Uyen K . 
CHAWLA, Narinder K. 
WARREN, Bridget A. 
YUE, Henry 
ZHENG, Wen j in 

<120> NEUROTRANSMISSION- ASSOCIATED PROTEINS 

<130> PF-1331 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> US 60/340,798 
<151> 2001-12-14 

<150> US 60/365,645 
<151> 2002-03-18 

<150> US 60/367,662 
<151> 2002-03-25 

<150> US 60/379,887 
<151> 2002-05-10 

<150> US 60/384,639 
<151> 2002-05-31 

<160> 44 

<170> PERL Program 
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<210> 1 
<211> 151 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7506758CD1 

<400> 1 

Met Ala Gin Glu lie Asp Leu Ser Ala Leu Lys Glu Leu Glu Arg 
15 10 15 

Glu Ala He Leu Gin Val Leu Tyr Arg Asp Gin Ala Val Gin Asn 
20 25 30 

Thr Glu Glu Glu Arg Thr Arg Lys Leu Lys Thr His Leu Gin His 
35 40 45 

Leu Arg Trp Lys Gly Ala Lys Asn Thr Asp Trp Glu His Lys Glu 
50 55 60 

Lys Cys Cys Ala Arg Cys Gin Gin Val Leu Gly Phe Leu Leu His 
65 70 75 

Arg Gly Ala Val Cys Arg Gly Cys Ser His Arg Val Cys Ala Gin 
80 85 90 

Cys Arg Val Phe Leu Arg Gly Thr His Ala Trp Lys Cys Thr Val 
95 100 105 

Cys Phe Glu Asp Arg Asn Val Lys He Lys Thr Gly Glu Trp Phe 
110 115 120 

Tyr Glu Glu Arg Ala Lys Lys Phe Pro Thr Gly Glu Gly Gin Cys 
125 130 135 

Thr Arg Tyr Ser Glu Thr Ser Gin Ser Arg Gly Ser Gin Met Leu 
140 145 150 

Tyr 



<210> 2 
<211> 2507 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 4381669CD1 

<400> 2 

Met Asp Ser Val Phe Ser He Tyr Phe Ser Met His Phe Gin Met 
15 10 15 

Arg He Leu Gly His Asn Ala Tyr Ala Leu Gin Gin Ala Ala Phe 

20 25 30 

Val Cys Leu Gin Leu Tyr Ser Arg Pro Asn Arg Leu Leu Ser Gly 

35 40 45 

Val Pro Leu Tyr Lys Gly Asp Val Pro Thr Gin Asp Lys Ser Ala 

50 55 60 

Val Thr Ser Leu Leu Asp Gly Leu Asn Gin Ala Phe Glu Glu Val 

65 70 75 

Ser Ser Gin Ser Gly Gly Ala Lys Arg Gin Ser He His Leu Glu 

80 85 90 

Gin Lys Leu Tyr Asp Gly Val Ser Ala Thr Ser Thr Trp Leu Asp 
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95 100 105 

Asp Val Glu Glu Arg Leu Phe Val Ala Thr Ala Leu Leu Pro Glu 
110 115 120 

Glu Thr Glu Thr Cys Leu Phe Asn Gin Glu He Leu Ala Lys Asp 
125 130 135 

He Lys Glu Met Ser Glu Glu Met Asp Lys Asn Lys Asn Leu Phe 
140 145 150 

Ser Gin Ala Phe Pro Glu Asn Gly Asp Asn Arg Asp Val He Glu 
155 160 165 

Asp Thr Leu Gly Cys Leu Leu Gly Arg Leu Ser Leu Leu Asp Ser 
170 175 180 

Val Val Asn Gin Arg Cys His Gin Met Lys Glu Arg Leu Gin Gin 
185 190 195 

He Leu Asn Phe Gin Asn Asp Leu Lys Val Leu Phe Thr Ser Leu 
200 205 210 

Ala Asp Asn Lys Tyr" He He Leu Gin Lys Leu Ala Asn Val Phe 
215 220 225 

Glu Gin Pro Val Ala Glu Gin He Glu Ala He Gin Gin Ala Glu 
230 235 240 

Asp Gly Leu Lys Glu Phe Asp Ala Gly He He Glu Leu Lys Arg 
245 250 255 

Arg Gly Asp Lys Leu Gin Val Glu Gin Pro Ser Met Gin Glu Leu 
260 265 270 

Ser Lys Leu Gin Asp Met Tyr Asp Glu Leu Met Met He He Gly 
275 280 285 

Ser Arg Arg Ser Gly Leu Asn Gin Asn Leu Thr Leu Lys Ser Gin 
290 295 300 

Tyr Glu Arg Ala Leu Gin Asp Leu Ala Asp Leu Leu Glu Thr Gly 
305 310 315 

Gin Glu Lys Met Ala Gly Asp Gin Lys He He Val Ser Ser Lys 
320 325 330 

Glu Glu He Gin Gin Leu Leu Asp Lys His Lys Glu Tyr Phe Gin 
335 340 345 

Gly Leu Glu Ser His Met He Leu Thr Glu Thr Leu Phe Arg Lys 
350 355 360 

He He Ser Phe Ala Val Gin Lys Glu Thr Gin Phe His Thr Glu 
365 370 375 

Leu Met Ala Gin Ala Ser Ala Val Leu Lys Arg Ala His Lys Arg 
380 385 390 

Gly Val Glu Leu Glu Tyr He Leu Glu Thr Trp Ser His Leu Asp 
395 400 405 

Glu Asp Gin Gin Glu Leu Ser Arg Gin Leu Glu Val Val Glu Ser 
410 415 . 420 

Ser He Pro Ser Val Gly Leu Val Glu Glu Asn Glu Asp Arg Leu 
425 430 435 

He Asp Arg He Thr Leu Tyr Gin His Leu Lys Ser Ser Leu Asn 
440 445 450 

Glu Tyr Gin Pro Lys Leu Tyr Gin Val Leu Asp Asp Gly Lys Arg 
455 460 465 

Leu Leu He Ser He Ser Cys Ser Asp Leu Glu Ser Gin Leu Asn 
470 475 480 

Gin Leu Gly Glu Cys Trp Leu Ser Asn Thr Asn Lys Met Ser Lys 
485 490 495 

Glu Leu His Arg Leu Glu Thr He Leu Lys His Trp Thr Arg Tyr 
500 505 510 

Gin Ser Glu Ser Ala Asp Leu He His Trp Leu Gin Ser Ala Lys 
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515 520 525 

Asp Arg Leu Glu Phe Trp Thr Gin Gin Ser Val Thr Val Pro Gin 
530 535 540 

Glu Leu Glu Met Val Arg Asp His Leu Asn Ala Phe Leu Glu Phe 
545 550 555 

Ser Lys Glu Val Asp Ala Gin Ser Ser Leu Lys Ser Ser Val Leu 
560 565 570 

Ser Thr Gly Asn Gin Leu Leu Arg Leu Lys Lys Val Asp Thr Ala 
575 580 585 

Thr Leu Arg Ser Glu Leu Ser Arg He Asp Ser Gin Trp Thr Asp 
590 595 600 

Leu Leu Thr Asn He Pro Ala Val Gin Glu Lys Leu His Gin Leu 
605 610 615 

Gin Met Asp Lys Leu Pro Ser Arg His Ala He Ser Glu Val Met 
620 625 630 

Ser Trp He Ser Leu Met Glu Asn Val He Gin Lys Asp Glu Asp 
' 635 640 645 

Asn He Lys Asn Ser He Gly Tyr Lys Ala He His Glu Tyr Leu 
650 655 660 

Gin Lys Tyr Lys Gly Phe Lys He Asp He Asn Cys Lys Gin Leu 
665 670 675 

Thr Val Asp Phe Val Asn Gin Ser Val Leu Gin He Ser Ser Gin 
680 685 690 

Asp Val Glu Ser Lys Arg Ser Asp Lys Thr Asp Phe Ala Glu Gin 
695 700 705 

Leu Gly Ala Met Asn Lys Ser Trp Gin He Leu Gin Gly Leu Val 
710 715 720 

Thr Glu Lys He Gin Leu Leu Glu Gly Leu Leu Glu Ser Trp Ser 
725 730 735 

Glu Tyr Glu Asn Asn Val Gin Cys Leu Lys Thr Trp Phe Glu Thr 
740 745 750 

Gin Glu Lys Arg Leu Lys Gin Gin His Arg He Gly Asp Gin Ala 
755 760 765 

Ser Val Gin Asn Ala Leu Lys Asp Cys Gin Asp Leu Glu Asp Leu 
770 775 780 

He Lys Ala Lys Glu Lys Glu Val Glu Lys He Glu Gin Asn Gly 
785 790 795 

Leu Ala Leu He Gin Asn Lys Lys Glu Asp Val Ser Ser He Val 
800 805 810 

Met Ser Thr Leu Arg Glu Leu Gly Gin Thr Trp Ala Asn Leu Asp 
815 820 825 

His Met Val Gly Gin Leu Lys He Leu Leu Lys Ser Val Leu Asp 
830 835 840 

Gin Trp Ser Ser His Lys Val Ala Phe Asp Lys He Asn Ser Tyr 
845 850 855 

Leu Met Glu Ala Arg Tyr Ser Leu Ser Arg Phe Arg Leu Leu Thr 
860 865 870 

Gly Ser Leu Glu Ala Val Gin Val Gin Val Asp Asn Leu Gin Asn 
875 880 885 

Leu Gin Asp Asp Leu Glu Lys Gin Glu Arg Ser Leu Gin Lys Phe 
890 895 900 

Gly Ser He Thr Asn Gin Leu Leu Lys Glu Cys His Pro Pro Val 
905 910 915 

Thr Glu Thr Leu Thr Asn Thr Leu Lys Glu Val Asn Met Arg Trp 
920 925 930 

Asn Asn Leu Leu Glu Glu He Ala Glu Gin Leu Gin Ser Ser Lys 



4/65 



WO 03/051902 



PCT/US02/40059 



935 940 945 

Ala Leu Leu Gin Leu Trp Gin Arg Tyr Lys Asp Tyr Ser Lys Gin 
950 955 960 

Cys Ala Ser Thr Val Gin Gin Gin Glu Asp Arg Thr Asn Glu Leu 
965 970 975 

Leu Lys Ala Ala Thr Asn Lys Asp He Ala Asp Asp Glu Val Ala 
980 985 990 

Thr Trp He Gin Asp Cys Asn Asp Leu Leu Lys Gly Leu Gly Thr 
995 1000 1005 

Val Lys Asp Ser Leu Phe Phe Leu His Glu Leu Gly Glu Gin Leu 
1010 1015 1020 

Lys Gin Gin Val Asp Ala Ser Ala Ala Ser Ala He Gin Ser Asp 
1025 1030 1035 

Gin Leu Ser Leu Ser Gin His Leu Cys Ala Leu Glu Gin Ala Leu 
1040 1045 1050 

Cys Lys Gin Gin Thr Ser Leu Gin Ala Gly Val Leu Asp Tyr Glu 
1055 1060 1065 

Thr Phe Ala Lys Ser Leu Glu Ala Leu Glu Ala Trp He Val Glu 
1070 1075 1080 

Ala Glu Glu He Leu Gin Gly Gin Asp Pro Ser His Ser Ser Asp 
1085 * 1090 1095 

Leu Ser Thr He Gin Glu Arg Met Glu Glu Leu Lys Gly Gin Met 
1100 1105 1110 

Leu Lys Phe Ser Ser Met Ala Pro Asp Leu Asp Arg Leu Asn Glu 
1115 1120 1125 

Leu Gly Tyr Arg Leu Pro Leu Asn Asp Lys Glu He Lys Arg Met 
1130 1135 1140 

Gin Asn Leu Asn Arg His Trp Ser Leu lie Ser Ser Gin Thr Thr 
1145 1150 1155 

Glu Arg Phe Ser Lys Leu Gin Ser Phe Leu Leu Gin His Gin Thr 
1160 1165 1170 

Phe Leu Glu Lys Cys Glu Thr Trp Met Glu Phe Leu Val Gin Thr 
1175 1180 1185 

Glu Gin Lys Leu Ala Val Glu He Ser Gly Asn Tyr Gin His Leu 
1190 1195 1200 

Leu Glu Gin Gin Arg Ala His Glu Val Leu Phe Gin Ala Glu Met 
1205 1210 1215 

Phe Ser Arg Gin Gin He Leu His Ser He He He Asp Gly Gin 
1220 , 1225 1230 

Arg Leu Leu Glu Gin Gly Gin Val Asp Asp Arg Asp Glu Phe Asn 
1235 1240 1245 

Leu Lys Leu Thr Leu Leu Ser Asn Gin Trp Gin Gly Val He Arg 
1250 1255 1260 

Arg Ala Gin Gin Arg Arg Gly He He Asp Ser Gin He Arg Gin 
1265 1270 1275 

Trp Gin Arg Tyr Arg Glu Met Ala Glu Lys Leu Arg Lys Trp Leu 
1280 1285 1290 

Val Glu Val Ser Tyr Leu Pro Met Ser Gly Leu Gly Ser Val Pro 
1295 1300 1305 

He Pro Leu Gin Gin Ala Arg Thr Leu Phe Asp Glu Val Gin Phe 
1310 1315 1320 

Lys Glu Lys Val Phe Leu Arg Gin Gin Gly Ser Tyr He Leu Thr 
1325 1330 1335 

Val Glu Ala Gly Lys Gin Leu Leu Leu Ser Ala Asp Ser Gly Ala 
1340 1345 1350 

Glu Ala Ala Leu Gin Ala Glu Leu Ala Glu He Gin Glu Lys Trp 
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1355 1360 1365 

Lys Ser Ala Ser Met Arg Leu Glu Glu Gin Lys Lys Lys Leu Ala 
1370 1375 1380 

Phe Leu Leu Lys Asp Trp Glu Lys Cys Glu Lys Gly lie Ala Asp 
1385 1390 1395 

Ser Leu Glu Lys Leu Arg Thr Phe Lys Lys Lys Leu Ser Gin Ser 
1400 1405 1410 

Leu Pro Asp His His Glu Glu Leu His Ala Glu Gin Met Arg Cys 
1415 1420 1425 

Lys Glu Leu Glu Asn Ala Val Gly Ser Trp Thr Asp Asp Leu Thr 
1430 1435 1440 

Gin Leu Ser Leu Leu Lys Asp Thr Leu Ser Ala Tyr lie Ser Ala 
1445 1450 1455 

Asp Asp He Ser He Leu Asn Glu Arg Val Glu Leu Leu Gin Arg 
1460 1465 1470 

Gin Trp Glu Glu Leu Cys His Gin Leu Ser Leu Arg Arg Gin Gin 
1475 1480 1485 

He Gly Glu Arg Leu Asn Glu Trp Ala Val Phe Ser Glu Lys Asn 
1490 1495 1500 

Lys Glu Leu Cys Glu Trp Leu Thr Gin Met Glu Ser Lys Val Ser 
1505 1510 1515 

Gin Asn Gly Asp He Leu He Glu Glu Met He Glu Lys Leu Lys 
1520 1525 1530 

Lys Asp Tyr Gin Glu Glu He Ala He Ala Gin Glu Asn Lys He 
1535 1540 1545 

Gin Leu Gin Gin Met Gly Glu Arg Leu Ala Lys Ala Ser His Glu 
1550 1555 1560 

Ser Lys Ala Ser Glu He Glu Tyr Lys Leu Gly Lys Val Asn Asp 
1565 1570 1575 

Arg Trp Gin His Leu Leu Asp Leu He Ala Ala Arg Val Lys Lys 
1580 1585 1590 

Leu Lys Glu Thr Leu Val Ala Val Gin Gin Leu Asp Lys Asn Met 
1595 1600 1605 

Ser Ser Leu Arg Thr Trp Leu Ala His He Glu Ser Glu Leu Ala 
1610 1615 1620 

Lys Pro He Val Tyr Asp Ser Cys Asn Ser Glu Glu He Gin Arg 
1625 1630 1635 

Lys Leu Asn Glu Gin Gin Glu Leu Gin Arg Asp He Glu Lys His 
1640 1645 1650 

Ser Thr Gly Val Ala Ser Val Leu Asn Leu Cys Glu Val Leu Leu 
1655 1660 1665 

His Asp Cys Asp Ala Cys Ala Thr Asp Ala Glu Cys Asp Ser He 
1670 1675 1680 

Gin Gin Ala Thr Arg Asn Leu Asp Arg Arg Trp Arg Asn He Cys 
1685 1690 1695 

Ala Met Ser Met Glu Arg Arg Leu Lys He Glu Glu Thr Trp Arg 
1700 1705 1710 

Leu Trp Gin Lys Phe Leu Asp Asp Tyr Ser Arg Phe Glu Asp Trp 
1715 1720 1725 

Leu Lys Ser Ser Glu Arg Thr Ala Ala Phe Pro Ser Ser Ser Gly 
1730 1735 1740 

Val He Tyr Thr Val Ala Lys Glu Glu Leu Lys Lys Phe Glu Ala 
1745 1750 1755 

Phe Gin Arg Gin Val His Glu Cys Leu Thr Gin Leu Glu Leu He 
1760 1765 1770 

Asn Lys Gin Tyr Arg Arg Leu Ala Arg Glu Asn Arg Thr Asp Ser 
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1775 1780 1785 

Ala Cys Ser Leu Lys Gin Met Val His Glu Gly Asn Gin Arg Trp 
1790 1795 1800 

Asp Asn Leu Gin Lys Arg Val Thr Ser He Leu Arg Arg Leu Lys 
1805 1810 1815 

His Phe He Gly Gin Arg Glu Glu Phe Glu Thr Ala Arg Asp Ser 
1820 1825 1830 

He Leu Val Trp Leu Thr Glu Met Asp Leu Gin Leu Thr Asn He 
1835 1840 1845 

Glu His Phe Ser Glu Cys Asp Val Gin Ala Lys He Lys Gin Leu 
1850 1855 1860 

Lys Ala Phe Gin Gin Glu He Ser Leu Asn His Asn Lys He Glu 
1865 1870 1875 

Gin He He Ala Gin Gly Glu Gin Leu He Glu Lys Ser Glu Pro 
1880 1885 1890 

Leu Asp Ala Ala He He Glu Glu Glu Leu Asp Glu Leu Arg Arg 
1895 1900 1905 

Tyr Cys Gin Glu Val Phe Gly Arg Val Glu Arg Tyr His Lys Lys 
1910 , 1915 1920 

Leu He Arg Leu Pro Val Tyr Asp Glu His Asp Leu Ser Asp Arg 
1925 1930 1935 

Glu Leu Glu Leu Glu Asp Ser Ala Ala Leu Ser Asp Leu His Trp 
1940 1945 1950 

His Asp Arg Ser Ala Asp Ser Leu Leu Ser Pro Gin Pro Ser Ser 
1955 1960 1965 

Asn Leu Ser Leu Ser Leu Ala Gin Pro Leu Arg Ser Glu Arg Ser 
1970 1975 1980 

Gly Arg Asp Thr Pro Ala Ser Val Asp Ser He Pro Leu Glu Trp 
1985 1990 1995 

Asp His Asp Tyr Asp Leu Ser Arg Asp Leu Glu Ser Ala Met Ser 
2000 2005 2010 

Arg Ala Leu Pro Ser Glu Asp Glu Glu Gly Gin Asp Asp Lys Asp 
2015 2020 2025 

Phe Tyr Leu Arg Gly Ala Val Gly Leu Ser Gly Asp His Ser Ala 
2030 2035 2040 

Leu Glu Ser Gin He Arg Gin Leu Gly Lys Ala Leu Asp Asp Ser 
2045 2050 2055 

Arg Phe Gin He Gin Gin Thr Glu Asn He He Arg Ser Lys Thr 
2060 2065 2070 

Pro Thr Gly Pro Glu Leu Asp Thr Ser Tyr Lys Gly Tyr Met Lys 
2075 2080 2085 

Leu Leu Gly Glu Cys Ser Ser Ser He Asp Ser Val Lys Arg Leu 
2090 2095 2100 

Glu His Lys Leu Lys Glu Glu Glu Glu Ser Leu Pro Gly Phe Val 
2105 2110 2115 

Asn Leu His Ser Thr Glu Thr Gin Thr Ala Gly Val He Asp Arg 
2120 2125 2130 

Trp Glu Leu Leu Gin Ala Gin Ala Leu Ser Lys Glu Leu Arg Met 
2135 2140 2145 

Lys Gin Asn Leu Gin Lys Trp Gin Gin Phe Asn Ser Asp Leu Asn 
2150 2155 2160 

Ser He Trp Ala Trp Leu Gly Asp Thr Glu Glu Glu Leu Glu Gin 
2165 2170 2175 

Leu Gin Arg Leu Glu Leu Ser Thr Asp He Gin Thr He Glu Leu 
2180 2185 2190 

Gin He Lys Lys Leu Lys Glu Leu Gin Lys Ala Val Asp His Arg 
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2195 2200 2205 

Lys Ala He He Leu Ser He Asn Leu Cys Ser Pro Glu Phe Thr 

2210 2215 2220 

Gin Ala Asp Ser Lys Glu Ser Arg Asp Leu Gin Asp Arg Leu Ser 

2225 2230 2235 

Gin Met Asn Gly Arg Trp Asp Arg Val Cys Ser Leu Leu Glu Glu 

2240 2245 2250 

Trp Arg Gly Leu Leu Gin Asp Ala Leu Met Gin Cys Gin Gly Phe 

2255 2260 2265 

His Glu Met Ser His Gly Leu Leu Leu Met Leu Glu Asn He Asp 

2270 2275 2280 

Arg Arg Lys Asn Glu He Val Pro He Asp Ser Asn Leu Asp Ala 

2285 2290 2295 

Glu He Leu Gin Asp His His Lys Gin Leu Met Gin He Lys His 

2300 2305 2310 

Glu Leu Leu Glu Ser Gin Leu Arg Val Ala Ser Leu Gin Asp Met 

2315 2320 2325 

Ser Cys Gin Leu Leu Val Asn Ala Glu Gly Thr Asp Cys Leu Glu 

2330 2335 2340 

Ala Lys Glu Lys Val His Val He Gly Asn Arg Leu Lys Leu Leu 

2345 2350 2355 

Leu Lys Glu Val Ser Arg His He Lys Glu Leu Glu Lys Leu Leu 

2360 2365 2370 

Asp Val Ser Ser Ser Gin Gin Asp Leu Ser Ser Trp Ser Ser Ala 

2375 2380 2385 

Asp Glu Leu Asp Thr Ser Gly Ser Val Ser Pro Thr Ser Gly Arg 

2390 2395 2400 

Ser Thr Pro Asn Arg Gin Lys Thr Pro Arg Gly Lys Cys Ser Leu 

2405 2410 2415 

Ser Gin Pro Gly Pro Ser Val Ser Ser Pro His Ser Arg Ser Thr 

2420 2425 2430 

Lys Gly Gly Ser Asp Ser Ser Leu Ser Glu Pro Gly Pro Gly Arg 

2435 2440 2445 

Ser Gly Arg Gly Phe Leu Phe Arg Val Leu Arg Ala Ala Leu Pro 

2450 2455 2460 

Leu Gin Leu Leu Leu Leu Leu Leu He Gly Leu Ala Cys Leu Val 

2465 2470 2475 

Pro Met Ser Glu Glu Asp Tyr Ser Cys Ala Leu Ser Asn Asn Phe 
2480 2485 2490 

Ala Arg Ser Phe His Pro Met Leu Arg Tyr Thr Asn Gly Pro Pro 
2495 2500 2505 

Pro Leu 



<210> 3 

<211> 375 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7503554CD1 

<400> 3 

Met Lys Asp Met Pro Leu Arg He His Val Leu Leu Gly Leu Ala 
15 10 15 
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He Thr Thr Leu Val Gin Ala Val Asp Lys Lys Val Asp Cys Pro 
20 25 30 

Arg Leu Cys Thr Cys Glu He Arg Pro Trp Phe Thr Pro Arg Ser 
35 40 45 

He Tyr Met Glu Ala Ser Thr Val Asp Cys Asn Asp Leu Gly Leu 
50 55 60 

Leu Thr Phe Pro Ala Arg Leu Pro Ala Asn Thr Gin He Leu Leu 
65 70 75 

Leu Gin Thr Asn Asn He Ala Lys He Glu Tyr Ser Thr Asp Phe 
80 85 90 

Pro Val Asn Leu Thr Gly Leu Asp Leu Ser Gin Asn Asn Leu Ser 
95 100 105 

Ser Val Thr Asn He Asn Val Lys Lys Met Pro Gin Leu Leu Ser 
110 115 120 

Val Tyr Leu Glu Glu Asn Lys Leu Thr Glu Leu Pro Glu Lys Cys 
125 130 135 

Leu Ser Glu Leu Ser Asn Leu Gin Glu Leu Tyr He Asn His Asn 
140 145 150 

Leu Leu Ser Thr He Ser Pro Gly Ala Phe He Gly Leu His Asn 
155 160 165 

Leu Leu Arg Leu His Leu Asn Ser Asn Arg Leu Gin Met He Asn 
170 175 180 

Ser Lys Trp Phe Asp Ala Leu Pro Asn Leu Glu He Leu Met He 
185 190 195 

Gly Glu Asn Pro He He Arg He Lys Asp Met Asn Phe Lys Pro 
200 205 210 

Leu He Asn Leu Arg Ser Leu Val He Ala Gly He Asn Leu Thr 
215 220 225 

Glu He Pro Asp Asn Ala Leu Val Gly Leu Glu Asn Leu Glu Ser 
230 235 240 

He Ser Phe Tyr Asp Asn Arg Leu He Lys Val Pro His Val Ala 
245 250 255 

Leu Gin Lys Val Val Asn Leu Lys Phe Leu Asp Leu Asn Lys Asn 
260 265 270 

Pro He Asn Arg He Arg Arg Gly Asp Phe Ser Asn Met Leu His 
275 280 285 

Leu Lys Glu Leu Gly He Asn Asn Met Pro Glu Leu He Ser He 
290 295 300 

Asp Ser Leu Ala Val Asp Asn Leu Pro Asp Leu Arg Lys He Glu 
305 310 315 

Ala Thr Asn Asn Pro Arg Leu Ser Tyr He His Pro Asn Ala Phe 
320 325 330 

Phe Arg Leu Pro Lys Leu Glu Ser Leu Met Leu Asn Ser Asn Ala 
335 340 345 

Leu Ser Ala Leu Cys His Pro Leu Asp Glu His Glu Gin Asn Gin 
350 355 360 

His Ser He His Gly Ala Arg Phe Thr Val Leu Arg Gly Pro Thr 
365 370 375 



<210> 4 

<211> 340 

<212> PRT 

<213> Homo sapiens 

<220> 
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<221> misc_feature 

<223> Incyte ID No: 7506077CD1 

<400> 4 

Met Gly Gly Cys Ala Gly Ser Arg Arg Arg Phe Ser Asp Ser Glu 
15 10 15 

Gly Glu Glu Thr Val Pro Glu Pro Arg Leu Pro Leu Leu Asp His 

20 25 30 

Gin Gly Ala His Trp Lys Asn Ala Val Gly Phe Trp Leu Leu Gly 

35 40 45 

Leu Cys Asn Asn Phe Ser Tyr Val Val Met Leu Ser Ala Ala His 
50 55 60 

Asp lie Leu Ser His Lys Arg Thr Ser Gly Asn Gin Ser His Val 
65 70 75 

Asp Pro Gly Pro Thr Pro lie Pro His Asn Ser Ser Ser Arg Phe 
80 85 90 

Asp Cys Asn Ser Val Ser Thr Ala Leu Ser Tyr Leu Gly Leu Thr 
95 100 105 

Gin Ala Gly Leu Ser Pro Gin Gin Thr Leu Leu Ser Met Leu Gly 

110 115 120 

lie Pro Ala Leu Leu Leu Ala Ser Tyr Phe Leu Leu Leu Thr Ser 

125 130 135 

Pro Glu Ala Gin Asp Pro Gly Gly Glu Glu Glu Ala Glu Ser Ala 

140 145 150 

Ala Arg Gin Pro Leu lie Arg Thr Glu Ala Pro Glu Ser Lys Pro 

155 160 165 

Gly Ser Ser Ser Ser Leu Ser Leu Arg Glu Arg Trp Thr Val Phe 

170 175 180 

Lys Gly Leu Leu Trp Tyr lie Val Pro Leu Val Val Val Tyr Phe 

185 190 195 

Ala Glu Tyr Phe He Asn Gin Gly Leu Phe Glu Leu Leu Phe Phe 

200 205 210 

Trp Asn Thr Ser Leu Ser His Ala Gin Gin Tyr Arg Trp Tyr Gin 

215 220 225 

Met Leu Tyr Gin Ala Gly Val Phe Ala Ser Arg Ser Ser Leu Arg 

230 235 240 

Cys Cys Arg He Arg Phe Thr Trp Ala Leu Ala Leu Leu Gin Cys 

245 250 255 

Leu Asn Leu Val Phe Leu Leu Ala Asp Val Trp Phe Gly Phe Leu 

260 265 270 

Pro Ser He Tyr Leu Val Phe Leu He He Leu Tyr Glu Gly Leu 

275 280 285 

Leu Gly Gly Ala Ala Tyr Val Asn Thr Phe His Asn He Ala Leu 

290 295 300 

Glu Thr Ser Asp Glu His Arg Glu Phe Ala Met Ala Ala Thr Cys 

305 310 315 

He Ser Asp Thr Leu Gly He Ser Leu Ser Gly Leu Leu Ala Leu 

320 325 330 

Pro Leu His Asp Phe Leu Cys Gin Leu Ser 

335 340 

<210> 5 

<211> 287 

<212> PRT 

<213> Homo sapiens 



10/65 



WO 03/051902 



PCT/US02/40059 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 7506765CD1 

<400> 5 

Met Ala Leu Gly Leu Leu lie Ala Val Pro Leu Leu Leu Gin Ala 
15 10 15 

Ala Pro Arg Gly Ala Ala His Tyr Glu Met Met Gly Thr Cys Arg 
20 25 30 

Met He Cys Asp Pro Tyr Thr Ala Ala Pro Gly Gly Glu Pro Pro 
35 40 45 

Gly Ala Lys Ala Gin Pro Pro Gly Pro Ser Thr Ala Ala Leu Glu 
50 55 60 

Val Met Gin Asp Leu Ser Ala Asn Pro Pro Pro Pro Phe He Gin 
65 70 75 

Gly Pro Lys Gly Asp Pro Gly Arg Pro Gly Lys Pro Gly Pro Arg 
80 85 90 

Gly Pro Pro Gly Glu Pro Gly Pro Pro Gly Pro Arg Gly Pro Pro 
95 100 105 

Gly Glu Lys Gly Asp Ser Gly Arg Pro Gly Leu Pro Gly Leu Gin 
110 115 120 

Leu Thr Ala Gly Thr Ala Ser Gly Val Gly Val Val Gly Gly Gly 
125 130 135 

Ala Gly Val Gly Gly Asp Ser Glu Gly Glu Val Thr Ser Ala Leu 
140 145 150 

Ser Ala Thr Phe Ser Gly Pro Lys He Ala Phe Tyr Val Gly Leu 
155 160 165 

Lys Ser Pro His Glu Gly Tyr Glu Val Leu Lys Phe Asp Asp Val 
170 175 180 

Val Thr Asn Leu Gly Asn His Tyr Asp Pro Thr Thr Gly Lys Phe 
185 190 195 

Ser Cys Gin Val Arg Gly He Tyr Phe Phe Thr Tyr His He Leu 
200 205 210 

Met Arg Gly Gly Asp Gly Thr Ser Met Trp Ala Asp Leu Cys Lys 
215 220 225 

Asn Gly Gin Val Arg Ala Ser Ala He Ala Gin Asp Ala Asp Gin 
230 235 240 

Asn Tyr Asp Tyr Ala Ser Asn Ser Val Val Leu His Leu Asp Ser 
245 250 255 

Gly Asp Glu Val Tyr Val Lys Leu Asp Gly Gly Lys Ala His Gly 
260 265 270 

Gly Asn Asn Asn Lys Tyr Ser Thr Phe Ser Gly Phe Leu Leu Tyr 
275 280 285 

Pro Asp 



<210> 6 
<211> 1048 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7490048CD1 

<400> 6 
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Met Leu Ala Val Ser Leu Lys Trp Arg Leu Gly Val Val Arg Arg 
15 10 15 

Arg Leu Lys Asp Asp Gly Pro Tyr Ser Lys Gly Gly Lys Asp Ala 
20 25 30 

Gly Gly Ala Asp Val Ser Leu Ala Cys Arg Arg Gin Ser He Pro 
35 40 45 

Glu Glu Phe Arg Gly He Thr Val Val Glu Leu He Lys Lys Glu 
50 55 60 

Gly Ser Thr Leu Gly Leu Thr He Ser Gly Gly Thr Asp Lys Asp 
65 70 75 

Gly Lys Pro Arg Val Ser Asn Leu Arg Pro Gly Gly Leu Ala Ala 
80 85 90 

Arg Ser Asp Leu Leu Asn He Gly Asp Tyr He Arg Ser Val Asn 
95 100 105 

Gly He His Leu Thr Arg Leu Arg His Asp Glu He He Thr Leu 
110 115 120 

Leu Lys Asn Val Gly Glu Arg Val Val Leu Glu Val Glu Tyr Glu 
125 130 135 

Leu Pro Pro Pro Ala Pro Glu Asn Asn Pro Arg He He Ser Lys 
140 145 150 

Thr Val Asp Val Ser Leu Tyr Lys Glu Gly Asn Ser Phe Gly Phe 
155 160 165 

Val Leu Arg Gly Gly Ala His Glu Asp Gly His Lys Ser Arg Pro 
170 175 180 

Leu Val Leu Thr Tyr Val Arg Pro Gly Gly Pro Ala Asp Arg Glu 
185 190 195 

Gly Ser Leu Lys Val Gly Asp Arg Leu Leu Ser Val Asp Gly He 
200 205 210 

Pro Leu His Gly Ala Ser His Ala Thr Ala Leu Ala Thr Leu Arg 
215 220 225 

Gin Cys Ser His Glu Ala Leu Phe Gin Val Glu Tyr Asp Val Ala 
230 235 240 

Thr Pro Asp Thr Val Ala Asn Ala Ser Gly Pro Leu Met Val Glu 
245 250 255 

He Val Lys Thr Pro Gly Ser Ala Leu Gly He Ser Leu Thr Thr 
260 265 270 

Thr Ser Leu Arg Asn Lys Ser Val He Thr He Asp Arg He Lys 
275 280 285 

Pro Ala Ser Val Val Asp Arg Ser Gly Ala Leu His Pro Gly Asp 
290 295 300 

His He Leu Ser He Asp Gly Thr Ser Met Glu His Cys Ser Leu 
305 310 315 

Leu Glu Ala Thr Lys Leu Leu Ala Ser He Ser Glu Lys Val Arg 
320 325 330 

Leu Glu He Leu Pro Val Pro Gin Ser Gin Arg Pro Leu Arg Pro 
335 340 345 

Ser Glu Ala Val Lys Val Gin Arg Ser Glu Gin Leu His Arg Trp 
350 355 360 

Asp Pro Cys Val Pro Ser Cys His Ser Pro Arg Pro Gly His Cys 
365 370 375 

Arg Met Pro Thr Trp Ala Thr Pro Ala Gly Gin Asp Gin Ser Arg 
380 385 390 

Ser Leu Ser Ser Thr Pro Phe Ser Ser Pro Thr Leu Asn His Ala 
395 400 405 

Phe Ser Cys Asn Asn Pro Ser Thr Leu Pro Arg Gly Ser Gin Pro 
410 415 420 
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Met Ser Pro Arg Thr Thr Met Gly Arg Arg Arg His Arg Arg Arg 
425 430 435 

Glu His Lys Thr Ser Leu Ser Leu Ala Ser Ser Thr Val Gly Pro 
440 445 450 

Gly Gly Gin lie Val His Thr Glu Thr Thr Glu Val Val Leu Cys 

455 460 465 

Gly Asp Pro Leu Ser Gly Phe Gly Leu Gin Leu Gin Gly Gly lie 

470 475 480 

Phe Ala Thr Glu Thr Leu Ser Ser Pro Pro Leu Val Cys Phe lie 

485 490 495 

Glu Pro Asp Ser Pro Ala Glu Arg Cys Gly Leu Leu Gin Val Gly 

500 505 510 

Asp Arg Val Leu Ser lie Asn Gly lie Ala Thr Glu Asp Gly Thr 

515 520 525 

Met Glu Glu Ala Asn Gin Leu Leu Arg Asp Ala Ala Leu Ala His 

530 535 540 

Lys Val Val Leu Glu Val Glu Phe Asp Val Ala Glu Ser Val He 

545 550 555 

Pro Ser Ser Gly Thr Phe His Val Lys Leu Pro Lys Lys Arg Ser 

560 565 570 

Val Glu Leu Gly He Thr He Ser Ser Ala Ser Arg Lys Arg Gly 

575 580 585 

Glu Pro Leu He He Ser Asp He Lys Lys Gly Ser Val Ala His 

590 595 600 

Arg Thr Gly Thr Leu Glu Pro Gly Asp Lys Leu Leu Ala He Asp 

605 610 615 

Asn He Arg Leu Asp Asn Cys Pro Met Glu Asp Ala Val Gin He 

620 625 630 

Leu Arg Gin Cys Glu Asp Leu Val Lys Leu Lys He Arg Lys Asp 

635 640 645 

Glu Asp Asn Ser Asp Glu Leu Glu Thr Thr Gly Ala Val Ser Tyr 

650 655 660 

Thr Val Glu Leu Lys Arg Tyr Gly Gly Pro Leu Gly He Thr He 

665 670 675 

Ser Gly Thr Glu Glu Pro Phe Asp Pro He Val He Ser Gly Leu 

680 685 690 

Thr Lys Arg Gly Leu Ala Glu Arg Thr Gly Ala He His Val Gly 

695 700 705 

Asp Arg He Leu Ala lie Asn Asn Val Ser Leu Lys Gly Arg Pro 

710 715 720 

Leu Ser Glu Ala He His Leu Leu Gin Val Ala Gly Glu Thr Val 

725 730 735 

Thr Leu Lys He Lys Lys Gin Leu Asp Arg Pro Leu Leu Pro Arg 

740 745 750 

Lys Ser Gly Ser Leu Ser Glu Thr Ser Asp Ala Asp Glu Asp Pro 

755 760 765 

Ala Asp Ala Leu Lys Gly Gly Leu Pro Ala Ala Arg Phe Ser Pro 

770 775 780 

Ala Val Pro Ser Val Asp Ser Ala Val Glu Ser Trp Asp Ser Ser 

785 790 795 

Ala Thr Glu Gly Gly Phe Gly Gly Pro Gly Ser Tyr Thr Pro Gin 

800 805 810 

Ala Ala Ala Arg Gly Thr Thr Pro Gin Glu Arg Arg Pro Gly Trp 

815 820 825 

Leu Arg Gly Ser Pro Pro Pro Thr Glu Pro Arg Arg Thr Ser Tyr 
830 835 840 
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Thr Pro Thr Pro Ala Asp Glu Ser Phe Pro Glu Glu Glu Glu Glu 
845 850 855 

Asp Asp Trp Glu Pro Pro Thr Ser Pro Ala Pro Gly Pro Ala Arg 
860 865 870 

Glu Glu Gly Phe Trp Arg Met Phe Gly Glu Ala Leu Glu Asp Leu 
875 880 885 

Glu Ser Cys Gly Gin Ser Glu Leu Leu Arg Glu Leu Glu Ala Ser 
890 895 900 

lie Met Thr Gly Thr Val Gin Arg Val Ala Leu Glu Gly Arg Pro 
905 910 915 

Gly His Arg Pro Trp Gin Arg Gly Arg Glu Val Arg Ala Ser Pro 
920 925 930 

Ala Glu Met Glu Glu Leu Leu Leu Pro Thr Pro Leu Glu Met His 
935 940 945 

Lys Val Thr Leu His Lys Asp Pro Met Arg His Asp Phe Gly Phe 
950 955 960 

Ser Val Ser Asp Gly Leu Leu Glu Lys Gly Val Tyr Val His Thr 
965 970 975 

Val Arg Pro Asp Gly Pro Ala His Arg Gly Gly Leu Gin Pro Phe 
980 985 990 

Asp Arg Val Leu Gin Val Asn His Val Arg Thr Arg Asp Phe Asp 
995 1000 1005 

Cys Cys Leu Ala Val Pro Leu Leu Ala Glu Ala Gly Asp Val Leu 
1010 1015 1020 

Glu Leu He He Ser Arg Lys Pro His Thr Ala His Ser Ser Arg 
1025 1030 1035 

Ala Pro Arg Ser Pro Gly Pro Ser Ser Pro Arg Met Leu 
1040 1045 



<210> 7 
<211> 419 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> miscLfeature 

<223> Incyte ID No: 2133585CD1 



<400> 7 

Met Gly Ser Pro Ala His Arg Pro Ala Leu Leu Leu Leu Leu Pro 
15 10 15 

Pro Leu Leu Leu Leu Leu Leu Leu Arg Val Pro Pro Ser Arg Ser 

20 25 30 

Phe Pro Gly Ser Gly Asp Ser Pro Leu Glu Asp Asp Glu Val Gly 

35 40 45 

Tyr Ser His Pro Arg Tyr Lys Asp Thr Pro Trp Cys Ser Pro He 

50 55 60 

Lys Val Lys Tyr Gly Asp Val Tyr Cys Arg Ala Pro Gin Gly Gly 

65 70 75 

Tyr Tyr Lys Thr Ala Leu Gly Thr Arg Cys Asp He Arg Cys Gin 

80 85 90 

Lys Gly Tyr Glu Leu His Gly Ser Ser Leu Leu He Cys Gin Ser 

95 100 105 

Asn Lys Arg Trp Ser Asp Lys Val He Cys Arg Gin Lys Arg Cys 
110 115 120 

Pro Thr Leu Ala Met Pro Ala Asn Gly Gly Phe Lys Cys Val Asp 
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125 



130 



135 



Gly Ala Tyr Phe Asn Ser Arg Cys Glu Tyr Tyr Cys Ser Pro Gly 
140 145 150 

Tyr Thr Leu Lys Gly Glu Arg Thr Val Thr Cys Met Asp Asn Lys 
155 160 165 

Ala Trp Ser Gly Arg Pro Ala Ser Cys Val Asp Met Glu Pro Pro 
170 175 180 

Arg lie Lys Cys Pro Ser Val Lys Glu Arg lie Ala Glu Pro Asn 
185 190 195 

Lys Leu Thr Val Arg Val Ser Trp Glu Thr Pro Glu Gly Arg Asp 
200 205 210 

Thr Ala Asp Gly lie Leu Thr Asp Val lie Leu Lys Gly Leu Pro 
215 220 225 

Pro Gly Ser Asn Phe Pro Glu Gly Asp His Lys lie Gin Tyr Thr 
230 235 240 

Val Tyr Asp Arg Ala Glu Asn Lys Gly Thr Cys Lys Phe Arg Val 
245 250 255 

Lys Val Arg Val Lys Arg Cys Gly Lys Leu Asn Ala Pro Glu Asn 
260 265 270 

Gly Tyr Met Lys Cys Ser Ser Asp Gly Asp Asn Tyr Gly Ala Thr 
275 280 285 

Cys Glu Phe Ser Cys lie Gly Gly Tyr Glu Leu Gin Gly Ser Pro 
290 295 300 

Ala Arg Val Cys Gin Ser Asn Leu Ala Trp Ser Gly Thr Glu Pro 
305 310 315 

Thr Cys Ala Ala Met Asn Val Asn Val Gly Val Arg Thr Ala Ala 
320 325 330 

Ala Leu Leu Asp Gin Phe Tyr Glu Lys Arg Arg Leu Leu lie Val 
335 340 345 

Ser Thr Pro Thr Ala Arg Asn Leu Leu Tyr Arg Leu Gin Leu Gly 
350 355 360 

Met Leu Gin Gin Ala Gin Cys Gly Leu Asp Leu Arg His lie Thr 
365 370 375 

Val Val Glu Leu Val Gly Val Phe Pro Thr Leu lie Gly Arg He 
380 385 390 

Gly Ala Lys He Met Pro Pro Ala Leu Ala Leu Gin Leu Ser Phe 
395 400 405 

He Thr Leu Thr Lys Leu Glu Pro Trp He Leu Val Ser Glu 



<210> 8 
<211> 1109 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7509063CD1 

<400> 8 

Met Arg Cys Lys Glu Leu Glu Asn Ala Val Gly Ser Trp Thr Asp 
15 10 15 

Asp Leu Thr Gin Leu Ser Leu Leu Lys Asp Thr Leu Ser Ala Tyr 
20 25 30 

He Ser Ala Asp Asp He Ser He Leu Asn Glu Arg Val Glu Leu 



410 



415 



35 



40 



45 
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Leu Gin Arg Gin Trp Glu Glu Leu Cys His Gin Leu Ser Leu Arg 
50 55 60 

Arg Gin Gin lie Gly Glu Arg Leu Asn Glu Trp Ala Val Phe Ser 
65 70 75 

Glu Lys Asn Lys Glu Leu Cys Glu Trp Leu Thr Gin Met Glu Ser 
80 85 90 

Lys Val Ser Gin Asn Gly Asp He Leu He Glu Glu Met He Glu 
95 100 105 

Lys Leu Lys Lys Asp Tyr Gin Glu Glu He Ala He Ala Gin Glu 
110 115 120 

Asn Lys He Gin Leu Gin Gin Met Gly Glu Arg Leu Ala Lys Ala 
125 ' 130 .135 

Ser His Glu Ser Lys Ala Ser Glu He Glu Tyr Lys Leu Gly Lys 
140 145 150 

Val Asn Asp Arg Trp Gin His Leu Leu Asp Leu He Ala Ala Arg 
155 160 165 

Val Lys Lys Leu Lys Glu Thr Leu Val Ala Val Gin Gin Leu Asp 
170 175 180 

Lys Asn Met Ser Ser Leu Arg Thr Trp Leu Ala His He Glu Ser 
185 190 195 

Glu Leu Ala Lys Pro He Val Tyr Asp Ser Cys Asn Ser Glu Glu 
200 205 210 

He Gin Arg Lys Leu Asn Glu Gin Gin Glu Leu Gin Arg Asp He 
215 220 225 

Glu Lys His Ser Thr Gly Val Ala Ser Val Leu Asn Leu Cys Glu 
230 235 240 

Val Leu Leu His Asp Cys Asp Ala Cys Ala Thr Asp Ala Glu Cys 
245 250 255 

Asp Ser He Gin Gin Ala Thr Arg Asn Leu Asp Arg Arg Trp Arg 
260 265 270 

Asn He Cys Ala Met Ser Met Glu Arg Arg Leu Lys He Glu Glu 
275 280 285 

Thr Trp Arg Leu Trp Gin Lys Phe Leu Asp Asp Tyr Ser Arg Phe 
290 295 300 

Glu Asp Trp Leu Lys Ser Ser Glu Arg Thr Ala Ala Phe Pro Ser 
305 310 315 

Ser Ser Gly Val He Tyr Thr Val Ala Lys Glu Glu Leu Lys Lys 
320 325 330 

Phe Glu Ala Phe Gin Arg Gin Val His Glu Cys Leu Thr Gin Leu 
335 340 345 

Glu Leu He Asn Lys Gin Tyr Arg Arg Leu Ala Arg Glu Asn Arg 
350 355 360 

Thr Asp Ser Ala Cys Ser Leu Lys Gin Met Val His Glu Gly Asn 
365 370 375 

Gin Arg Trp Asp Asn Leu Gin Lys Arg Val Thr Ser He Leu Arg 
380 385 390 

Arg Leu Lys His Phe He Gly Gin Arg Glu Glu Phe Glu Thr Ala 
395 400 405 

Arg Asp Ser He Leu Val Trp Leu Thr Glu Met Asp Leu Gin Leu 
410 415 420 

Thr Asn He Glu His Phe Ser Glu Cys Asp Val Gin Ala Lys He 
425 430 435 

Lys Gin Leu Lys Ala Phe Gin Gin Glu He Ser Leu Asn His Asn 
440 445 450 

Lys He Glu Gin He He Ala Gin Gly Glu Gin Leu He Glu Lys 
455 460 465 
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Ser Glu Pro Leu Asp Ala Ala He He Glu Glu Glu Leu Asp Glu 
470 475 480 

Leu Arg Arg Tyr Cys Gin Glu Val Phe Gly Arg Val Glu Arg Tyr 
485 490 495 

His Lys Lys Leu He Arg Leu Pro Leu Pro Asp Asp Glu His Asp 
500 505 510 

Leu Ser Asp Arg Glu Leu Glu Leu Glu Asp Ser Ala Ala Leu Ser 
515 520 525 

Asp Leu His Trp His Asp Arg Ser Ala Asp Ser Leu Leu Ser Pro 

530 535 540 

Gin Pro Ser Ser Asn Leu Ser Leu Ser Leu Ala Gin Pro Leu Arg 

545 550 555 

Ser Glu Arg Ser Gly Arg Asp Thr Pro Ala Ser Val Asp Ser He 

560 565 570 

Pro Leu Glu Trp Asp His Asp Tyr Asp Leu Ser Arg Asp Leu Glu 

575 580 585 

Ser Ala Met Ser Arg Ala Leu Pro Ser Glu Asp Glu Glu Gly Gin 

590 595 600 

Asp Asp Lys Asp Phe Tyr Leu Arg Gly Ala Val Ala Leu Ser Asp 

605 610 615 

Val Met He Pro Glu Ser Pro Glu Ala Tyr Val Lys Leu Thr Glu 

620 625 630 

Asn Ala He Lys Asn Thr Ser Gly Asp His Ser Ala Leu Glu Ser 

635 640 645 

Gin He Arg Gin Leu Gly Lys Ala Leu Asp Asp Ser Arg Phe Gin 

650 655 660 

He Gin Gin Thr Glu Asn He He Arg Ser Lys Thr Pro Thr Gly 

665 670 675 

Pro Glu Leu Asp Thr Ser Tyr Lys Gly Tyr Met Lys Leu Leu Gly 

680 685 690 

Glu Cys Ser Ser Ser He Asp Ser Val Lys Arg Leu Glu His Lys 

695 700 705 

Leu Lys Glu Glu Glu Glu Ser Leu Pro Gly Phe Val Asn Leu His 

710 715 720 

Ser Thr Glu Thr Gin Thr Ala Gly Val He Asp Arg Trp Glu Leu 

725 730 735 

Leu Gin Ala Gin Ala Leu Ser Lys Glu Leu Arg Met Lys Gin Asn 

740 745 750 

Leu Gin Lys Trp Gin Gin Phe Asn Ser Asp Leu Asn Ser He Trp 

755 760 765 

Ala Trp Leu Gly Asp Thr Glu Glu Glu Leu Glu Gin Leu Gin Arg 

770 775 780 

Leu Glu Leu Ser Thr Asp He Gin Thr He Glu Leu Gin He Lys 

785 790 795 

Lys Leu Lys Glu Leu Gin Lys Ala Val Asp His Arg Lys Ala He 

800 805 810 

He Leu Ser He Asn Leu Cys Ser Pro Glu Phe Thr Gin Ala Asp 

815 820 825 

Ser Lys Glu Ser Arg Asp Leu Gin Asp Arg Leu Ser Gin Met Asn 

830 835 840 

Gly Arg Trp Asp Arg Val Cys Ser Leu Leu Glu Glu Trp Arg Gly 

845 850 855 

Leu Leu Gin Asp Ala Leu Met Gin Cys Gin Gly Phe His Glu Met 

860 865 870 

Ser His Gly Leu Leu Leu Met Leu Glu Asn He Asp Arg Arg Lys 

875 880 885 
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Asn Glu lie Val Pro He Asp Ser Asn Leu Asp Ala Glu He Leu 
890 895 900 

Gin Asp His His Lys Gin Leu Met Gin He Lys His Glu Leu Leu 
905 910 915 

Glu Ser Gin Leu Arg Val Ala Ser Leu Gin Asp Met Ser Cys Gin 
920 925 930 

Leu Leu Val Asn Ala Glu Gly Thr Asp Cys Leu Glu Ala Lys Glu 
935 940 945 

Lys Val His Val He Gly Asn Arg Leu Lys Leu Leu Leu Lys Glu 
950 955 960 

Val Ser Arg His He Lys Glu Leu Glu Lys Leu Leu Asp Val Ser 
965 970 975 

Ser Ser Gin Gin Asp Leu Ser Ser Trp Ser Ser Ala Asp Glu Leu 
980 985 990 

Asp Thr Ser Gly Ser Val Ser Pro Thr Ser Gly Arg Ser Thr Pro 
995 1000 1005 

Asn Arg Gin Lys Thr Pro Arg Gly Lys Cys Ser Leu Ser Gin Pro 
1010 1015 1020 

Gly Pro Ser Val Ser Ser Pro His Ser Arg Ser Thr Lys Gly Gly 
1025 1030 1035 

Ser Asp Ser Ser Leu Ser Glu Pro Gly Pro Gly Arg Ser Gly Arg 
1040 1045 1050 

Gly Phe Leu Phe Arg Val Leu Arg Ala Ala Leu Pro Leu Gin Leu 
1055 1060 1065 

Leu Leu Leu Leu Leu He Gly Leu Ala Cys Leu Val Pro Met Ser 
1070 1075 1080 

Glu Glu Asp Tyr Ser Cys Ala Leu Ser Asn Asn Phe Ala Arg Ser 
1085 1090 1095 

Phe His Pro Met Leu Arg Tyr Thr Asn Gly Pro Pro Pro Leu 
1100 1105 

<210> 9 
<211> 1308 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc__feature 

<223> Incyte ID Wo: 5496710CD1 

<400> 9 

Met Gly Ser Val Thr Gly Ala Val Leu Lys Thr Leu Leu Leu Leu 

15 10 15 

Ser Thr Gin Asn Trp Asn Arg Val Glu Ala Gly Asn Ser Tyr Asp 

20 25 30 

Cys Asp Asp Pro Leu Val Ser Ala Leu Pro Gin Ala Ser Phe Ser 

35 40 45 

Ser Ser Ser Glu Leu Ser Ser Ser His Gly Pro Gly Phe Ala Arg 

50 55 60 

Leu Asn Arg Arg Asp Gly Ala Gly Gly Trp Ser Pro Leu Val Ser 

65 70 75 

Asn Lys Tyr Gin Trp Leu Gin He Asp Leu Gly Glu Arg Met Glu 

80 85 90 

Val Thr Ala Val Ala Thr Gin Gly Gly Tyr Gly Ser Ser Asn Trp 

95 100 105 
Val Thr Ser Tyr Leu Leu Met Phe Ser Asp Ser Gly Trp Asn Trp 
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110 115 120 

Lys Gin Tyr Arg Gin Glu Asp Ser lie Trp Gly Phe Ser Gly Asn 
125 130 135 

Ala Asn Ala Asp Ser Val Val Tyr Tyr Arg Leu Gin Pro Ser lie 
140 145 150 

Lys Ala Arg Phe Leu Arg Phe lie Pro Leu Glu Trp Asn Pro Lys 
155 160 165 

Gly Arg lie Gly Met Arg He Glu Val Phe Gly Cys Ala Tyr Arg 
170 175 180 

Ser Glu Val Val Asp Leu Asp Gly Lys Ser Ser Leu Leu Tyr Arg 
185 190 195 

Phe Asp Gin Lys Ser Leu Ser Pro He Lys Asp He He Ser Leu 
200 205 210 

Lys Phe Lys Thr Met Gin Ser Asp Gly He Leu Leu His Arg Glu 
215 220 225 

Gly Pro Asn Gly Asp His He Thr Leu Gin Leu Arg Arg Ala Arg 
230 235 240 

Leu Phe Leu Leu He Asn Ser Gly Glu Ala Lys Leu Pro Ser Thr 
245 250 255 

Ser Thr Leu Val Asn Leu Thr Leu Gly Ser Leu Leu Asp Asp Gin 
260 265 270 

His Trp His Ser Val Leu He Gin Arg Leu Gly Lys Gin Val Asn 
275 280 285 

Phe Thr Val Asp Glu His Arg His His Phe His Ala Arg Gly Glu 
290 295 300 

Phe Asn Leu Met Asn Leu Asp Tyr Glu He Ser Phe Gly Gly He 
305 310 315 

Pro Ala Pro Gly Lys Ser Val Ser Phe Pro His Arg Asn Phe His 
320 325 330 

Gly Cys Leu Glu Asn Leu Tyr Tyr Asn Gly Val Asp He He Asp 
335 340 345 

Leu Ala Lys Gin Gin Lys Pro Gin He He Ala Met Gly Asn Val 
350 355 360 

Ser Phe Ser Cys Ser Gin Pro Gin Ser Met Pro Val Thr Phe Leu 
365 370 375 

Ser Ser Arg Ser Tyr Leu Ala Leu Pro Asp Phe Ser Gly Glu Glu 
380 385 390 

Glu Val Ser Ala Thr Phe Gin Phe Arg Thr Trp Asn Lys Ala Gly 
395 400 405 

Leu Leu Leu Phe Ser Glu Leu Gin Leu He Ser Gly Gly He Leu 
410 415 420 

Leu Phe Leu Ser Asp Gly Lys Leu Lys Ser Asn Leu Tyr Gin Pro 
425 430 435 

Gly Lys Leu Pro Ser Asp He Thr Ala Gly Val Glu Leu Asn Asp 
440 445 450 

Gly Gin Trp His Ser Val Ser Leu Ser Ala Lys Lys Asn His Leu 
455 460 465 

Ser Val Ala Val Asp Gly Gin Met Ala Ser Ala Ala Pro Leu Leu 
470 475 480 

Gly Pro Glu Gin He Tyr Ser Gly Gly Thr Tyr Tyr Phe Gly Gly 
485 490 495 

Cys Pro Asp Lys Ser Phe Gly Ser Lys Cys Lys Ser Pro Leu Gly 
500 505 510 

Gly Phe Gin Gly Cys Met Arg Leu He Ser He Ser Gly Lys Val 
515 520 525 

Val Asp Leu He Ser Val Gin Gin Gly Ser Leu Gly Asn Phe Ser 
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530 535 540 

Asp Leu Gin lie Asp Ser Cys Gly lie Ser Asp Arg Cys Leu Pro 
545 550 555 

Asn Tyr Cys Glu His Gly Gly Glu Cys Ser Gin Ser Trp Ser Thr 
560 565 570 

Phe His Cys Asn Cys Thr Asn Thr Gly Tyr Arg Gly Ala Thr Cys 
575 580 585 

His Asn Ser He Tyr Glu Gin Ser Cys Glu Ala Tyr Lys His Arg 
590 595 600 

Gly Asn Thr Ser Gly Phe Tyr Tyr He Asp Ser Asp Gly Ser Gly 
605 610 615 

Pro Leu Glu Pro Phe Leu Leu Tyr Cys Asn Met Thr Glu Thr Ala 
620 625 * 630 

Trp Thr He He Gin His Asn Gly Ser Asp Leu Thr Arg Val Arg 
635 640 645 

Asn Thr Asn Pro Glu Asn Pro Tyr Ala Gly Phe Phe Glu Tyr Val 
650 655 660 

Ala Ser Met Glu Gin Leu Gin Ala Thr He Asn Arg Ala Glu His 
665 670 675 

Cys Glu Gin Glu Phe Thr Tyr Tyr Cys Lys Lys Ser Arg Leu Val 
680 685 690 

Asn Lys Gin Asp Gly Thr Pro Leu Ser Trp Trp Val Gly Arg Thr 
695 700 705 

Asn Glu Thr Gin Thr Tyr Trp Gly Gly Ser Ser Pro Asp Leu Gin 
710 715 720 

Lys Cys Thr Cys Gly Leu Glu Gly Asn Cys He Asp Ser Gin Tyr 
725 730 735 

Tyr Cys Asn Cys Asp Ala Asp Arg Asn Glu Trp Thr Asn Asp Thr 
740 745 750 

Gly Leu Leu Ala Tyr Lys Glu His Leu Pro Val Thr Lys He Val 
755 760' 765 

He Thr Asp Thr Gly Arg Leu His Ser Glu Ala Ala Tyr Lys Leu 
770 775 780 

Gly Pro Leu Leu Cys Arg Gly Asp Arg Ser Phe Trp Asn Ser Ala 
785 790 795 

Ser Phe Asp Thr Glu Ala Ser Tyr Leu His Phe Pro Thr Phe His 
800 805 810 

Gly Glu Leu Ser Ala Asp Val Ser Phe Phe Phe Lys Thr Thr Ala 
815 820 825 

Ser Ser Gly Val Phe Leu Glu Asn Leu Gly He Ala Asp Phe He 
830 835 840 

Arg He Glu Leu Arg Ser Pro Thr Val Val Thr Phe Ser Phe Asp 
845 850 855 

Val Gly Asn Gly Pro Phe Glu He Ser Val Gin Ser Pro Thr His 
860 865 870 

Phe Asn Asp Asn Gin Trp His His Val Arg Val Glu Arg Asn Met 
875 880 885 

Lys Glu Ala Ser Leu Gin Val Asp Gin Leu Thr Pro Lys Thr Gin 
890 895 900 

Pro Ala Pro Ala Asp Gly His Val Leu Leu Gin Leu Asn Ser Gin 
905 910 915 

Leu Phe Val Gly Gly Thr Ala Thr Arg Gin Arg Gly Phe Leu Gly 
920 925 930 

Cys He Arg Ser Leu Gin Leu Asn Gly Met Thr Leu Asp Leu Glu 
935 940 945 

Glu Arg Ala Gin Val Thr Pro Glu Val Gin Pro Gly Cys Arg Gly 
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950 955 960 

His Cys Ser Ser Tyr Gly Lys Leu Cys Arg Asn Gly Gly Lys Cys 
965 970 975 

Arg Glu Arg Pro lie Gly Phe Phe Cys Asp Cys Thr Phe Ser Ala 
980 985 990 

Tyr Thr Gly Pro Phe Cys Ser Asn Glu lie Ser Ala Tyr Phe Gly 
995 1000 1005 

Ser Gly Ser Ser Val lie Tyr Asn Phe Gin Glu Asn Tyr Leu Leu 
1010 1015 1020 

Ser Lys Asn Ser Ser Ser His Ala Ala Ser Phe His Gly Asp Met 
1025 1030 1035 

Lys Leu Ser Arg Glu Met lie Lys Phe Ser Phe Arg Thr Thr Arg 
1040 1045 1050 

Thr Pro Ser Leu Leu Leu Phe Val Ser Ser Phe Tyr Lys Glu Tyr 
• 1055 1060 1065 

Leu Ser Val lie lie Ala Lys Asn Gly Ser Leu Gin lie Arg Tyr 
1070 1075 1080 

Lys Leu Asn Lys Tyr Gin Glu Pro Asp Val Val Asn Phe Asp Phe 
1085 1090 1095 

Lys Asn Met Ala Asp Gly Gin Leu His His lie Met lie Asn Arg 
1100 1105 1110 

Glu Glu Gly Val Val Phe He Glu He Asp Asp Asn Arg Arg Arg 
1115 1120 1125 

Gin Val His Leu Ser Ser Gly Thr Glu Phe Ser Ala Val Lys Ser 
1130 1135 1140 

Leu Val Leu Gly Arg He Leu Glu His Ser Asp Val Asp Gin Asp 
1145 1150 1155 

Thr Ala Leu Ala Gly Ala Gin Gly Phe Thr Gly Cys Leu Ser Ala 
1160 1165 1170 

Val Gin Leu Ser His Val Ala Pro Leu Lys Ala Ala Leu His Pro 
1175 1180 1185 

Ser His Pro Asp Pro Val Thr Val Thr Gly His Val Thr Glu Ser 
1190 1195 1200 

Ser Cys Met Ala Gin Pro Gly Thr Asp Ala Thr Ser Arg Glu Arg 
1205 1210 1215 

Thr His Ser Phe Ala Asp His Ser Gly Thr He Asp Asp Arg Glu 
1220 1225 1230 

Pro Leu Ala Asn Ala He Lys Ser Asp Ser Ala Val He Gly Gly 
1235 1240 1245 

Leu He Ala Val Val He Phe He Leu Leu Cys He Thr Ala He 
1250 1255 1260 

Ala Val Arg He Tyr Gin Gin Lys Arg Leu Tyr Lys Arg Ser Glu 
1265 1270 1275 

Ala Lys Arg Ser Glu Asn Val Asp Ser Ala Glu Ala Val Leu Lys 
1280 1285 1290 

Ser Glu Leu Asn He Gin Asn Ala Val Asn Glu Asn Gin Lys Glu 
1295 1300 1305 

Tyr Phe Phe 



<210> 10 

<211> 1009 

<212> PRT 

<213> Homo sapiens 

<220> 
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<221> misc_feature 

<223> Incyte ID No: 72150826CD1 

<400> 10 

Met Asp Leu lie Arg Gly Val Leu Leu Arg Leu Leu Leu Leu Ala 
1 5 10 15 

Ser Ser Leu Gly Pro Gly Ala Val Ser Leu Arg Ala Ala He Arg 
20 25 30 

Lys Pro Gly Lys Val Gly Pro Pro Leu Asp He Lys Leu Gly Ala 
35 40 45 

Leu Asn Cys Thr Ala Phe Ser He Gin Trp Lys Met Pro Arg His 
50 55 60 

Pro Gly Ser Pro He Leu Gly Tyr Thr Val Phe Tyr Ser Glu Val 
65 70 75 

Gly Ala Asp Lys Ser Leu Gin Glu Gin Leu His Ser Val Pro Leu 
80 85 90 

Ser Arg Asp He Pro Thr Thr Glu Glu Val He Gly Asp Leu Lys 
95 100 105 

Pro Gly Thr Glu Tyr His Val Ser He Ala Ala Tyr Ser Gin Ala 
110 115 120 

Gly Lys Gly Arg Leu Ser Ser Pro Arg His Val Thr Thr Leu Ser 
125 130 135 

Gin Asp Ser Cys Leu Pro Pro Ala Ala Pro Gin Gin Pro His Val 
140 145 150 

He Val Val Ser Asp Ser Glu Val Ala Leu Ser Trp Lys Pro Gly 
155 160 165 

Ala Ser Glu Gly Ser Ala Pro He Gin Tyr Tyr Ser Val Glu Phe 
170 175 180 

He Arg Pro Asp Phe Asp Lys Lys Trp Thr Ser He His Glu Arg 
185 190 195 

He Gin Met Asp Ser Met Val He Lys Gly Leu Asp Pro Asp Thr 
200 205 210 

Asn Tyr Gin Phe Ala Val Arg Ala Met Asn Ser His Gly Pro Ser 
215 220 225 

Pro Arg Ser Trp Pro Ser Asp He He Arg Thr Leu Cys Pro Glu 
230 235 240 

Glu Ala Gly Ser Gly Arg Tyr Gly Pro Arg Tyr He Thr Asp Met 
245 250 255 

Gly Ala Gly Glu Asp Asp Glu Gly Phe Glu Asp Asp Leu Asp Leu 
260 265 270 

Asp He Ser Phe Glu Glu Val Lys Pro Leu Pro Ala Thr Lys Gly 
275 280 285 

Gly Asn Lys Lys Phe Leu Val Glu Ser Lys Lys Met Ser He Ser 
290 295 300 

Asn Pro Lys Thr He Ser Arg Leu He Pro Pro Thr Ser Ala Ser 
305 310 315 

Leu Pro Val Thr Thr Val Ala Pro Gin Pro He Pro He Gin Arg 
320 325 330 

Lys Gly Lys Asn Gly Val Ala He Met Ser Arg Leu Phe Asp Met 
335 340 345 

Pro Cys Asp Glu Thr Leu Cys Ser Ala Asp Ser Phe Cys Val Asn 
350 355 360 

Asp Tyr Thr Trp Gly Gly Ser Arg Cys Gin Cys Thr Leu Gly Lys 
365 370 375 

Gly Gly Glu Ser Cys Ser Glu Asp He Val He Gin Tyr Pro Gin 
380 385 390 
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Phe Phe Gly His Ser Tyr Val Thr Phe Glu Pro Leu Lys Asn Ser 
395 400 405 

Tyr Gin Ala Phe Gin lie Thr Leu Glu Phe Arg Ala Glu Ala Glu 
410 415 420 

Asp Gly Leu Leu Leu Tyr Cys Gly Glu Asn Glu His Gly Arg Gly 
425 430 435 

Asp Phe Met Ser Leu Ala lie lie Arg Arg Ser Leu Gin Phe Arg 
440 445 450 

Phe Asn Cys Gly Thr Gly Val Ala lie He Val Ser Glu Thr Lys 
455 460 465 

He Lys Leu Gly Gly Trp His Thr Val Met Leu Tyr Arg Asp Gly 
470 475 480 

Leu Asn Gly Leu Leu Gin Leu Asn Asn Gly Thr Pro Val Thr Gly 
485 490 495 

Gin Ser Gin Gly Gin Tyr Ser Lys He Thr Phe Arg Thr Pro Leu 
500 505 510 

Tyr Leu Gly Gly Ala Pro Ser Ala Tyr Trp Leu Val Arg Ala Thr 
515 520 525 

Gly Thr Asn Arg Gly Phe Gin Gly Cys Val Gin Ser Leu Ala Val 
530 535 540 

Asn Gly Arg Arg He Asp Met Arg Pro Trp Pro Leu Gly Lys Ala 
545 550 555 

Leu Ser Gly Ala Asp Val Gly Glu Cys Ser Ser Gly He Cys Asp 
560 565 570 

Glu Ala Ser Cys He His Gly Gly Thr Cys Thr Ala He Lys Ala 
575 580 585 

Asp Ser Tyr He Cys Leu Cys Pro Leu Gly Phe Lys Gly Arg His 
590 595 600 

Cys Glu Asp Ala Phe Thr Leu Thr He Pro Gin Phe Arg Glu Ser 
605 610 615 

Leu Arg Ser Tyr Ala Ala Thr Pro Trp Pro Leu Glu Pro Gin His 
620 625 630 

Tyr Leu Ser Phe Met Glu Phe Glu He Thr Phe Arg Pro Asp Ser 
635 640 645 

Gly Asp Gly Val Leu Leu Tyr Ser Tyr Asp Thr Gly Ser Lys Asp 
650 655 660 

Phe Leu Ser He Asn Leu Ala Gly Gly His Val Glu Phe Arg Phe 
665 670 675 

Asp Cys Gly Ser Gly Thr Gly Val Leu Arg Ser Glu Asp Pro Leu 
680 685 690 

Thr Leu Gly Asn Trp His Glu Leu Arg Val Ser Arg Thr Ala Lys 
695. 700 705 

Asn Gly He Leu Gin Val Asp Lys Gin Lys He Val Glu Gly Met 
710 715 720 

Ala Glu Gly Gly Phe Thr Gin He Lys Cys Asn Thr Asp He Phe 
725 730 735 

He Gly Gly Val Pro Asn Tyr Asp Asp Val Lys Lys Asn Ser Gly 
740 745 750 

Val Leu Lys Pro Phe Ser Gly Ser He Gin Lys He He Leu Asn 
755 760 765 

Asp Arg Thr He His Val Lys His Asp Phe Thr Ser Gly Val Asn 
770 775 780 

Val Glu Asn Ala Ala His Pro Cys Val Arg Ala Pro Cys Ala His 
785 790 795 

Gly Gly Ser Cys Arg Pro Arg Lys Glu Gly Tyr Asp Cys Asp Cys 
800 805 810 
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Pro Leu Gly Phe Glu Gly Leu His Cys Gin Lys Ala lie lie Glu 

815 820 825 

Ala He Glu He Pro Gin Phe He Gly Arg Ser Tyr Leu Thr Tyr 

830 835 840 

Asp Asn Pro Asp lie Leu Lys Arg Val Ser Gly Ser Arg Ser Asn 

845 850 855 

Val Phe Met Arg Phe Lys Thr Thr Ala Lys Asp Gly Leu Leu Leu 

860 865 870 

Trp Arg Gly Asp Ser Pro Met Arg Pro Asn Ser Asp Phe He Ser 

875 880 885 

Leu Gly Leu Arg Asp Gly Ala Leu Val Phe Ser Tyr Asn Leu Gly 

890 895 900 

Ser Gly Val Ala Ser He Met Val Asn Gly Ser Phe Asn Asp Gly 

905 910 915 

Arg Trp His Arg Val Lys Ala Val Arg Asp Gly Gin Ser Gly Lys 

920 925 930 

He Thr Val Asp Asp Tyr Gly Ala Arg Thr Gly Lys Ser Pro Gly 

935 940 945 

Met Met Arg Gin Leu Asn He Asn Gly Ala Leu Tyr Val Gly Gly 

950 955 960 

Met Lys Glu He Ala Leu His Thr Asn Arg Gin Tyr Met Arg Gly 

965 970 975 

Leu Val Gly Cys He Ser His Phe Thr Leu Ser Thr Asp Tyr His 

980 985 990 

He Ser Leu Val Glu Asp Ala Val Asp Gly Lys Asn He Asn Thr 

995 1000 1005 

Cys Gly Ala Lys 



<210> 11 

<211> 962 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> miec_f eature 

<223> Incyte ID No: 6799476CD1 



<400> 11 



Met Leu Arg Arg 


Pro 


Ala 


Pro Ala 


Leu 


Ala Pro Ala Ala Arg 


Leu 


1 


5 








10 


15 


Leu Leu Ala Gly Leu Leu 


Cys Gly Gly 


Gly Val Trp Ala Ala 


Arg 




20 








25 


30 


Val Asn Lys His 


Lys 


Pro 


Trp Leu 


Glu 


Pro Thr Tyr His Gly 


He 




35 








40 


45 


Val Thr Glu Asn 


Asp 


Asn 


Thr Val 


Leu 


Leu Asp Pro Pro Leu 


He 




50 








55 


60 


Ala Leu Asp Lys 


Asp 


Ala 


Pro Leu Arg 


Phe Ala Glu Ser Phe 


Glu 




65 








70 


75 


Val Thr Val Thr 


Lys 


Glu 


Gly Glu 


He 


Cys Gly Phe Lys He 


His 




80 








85 


90 


Gly Gin Asn Val 


Pro 


Phe 


Asp Ala 


Val 


Val Val Asp Lys Ser 


Thr 


95 








100 


105 


Gly Glu Gly Val 


He 


Arg 


Ser Lys 


Glu 


Lys Leu Asp Cys Glu 


Leu 


110 








115 


120 


Gin Lys Asp Tyr 


Ser 


Phe 


Thr He 


Gin 


Ala Tyr Asp Cys Gly 


Lys 
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125 130 135 

Gly Pro Asp Gly Thr Asn Val Lys Lys Ser His Lys Ala Thr Val 
140 145 150 

His lie Gin Val Asn Asp Val Asn Glu Tyr Ala Pro Val Phe Lys 
155 160 165 

Glu Lys Ser Tyr Lys Ala Thr Val He Glu Gly Lys Gin Tyr Asp 
170 175 180 

Ser He Leu Arg Val Glu Ala Val Asp Ala Asp Cys Ser Pro Gin 
185 190 195 

Phe Ser Gin He Cys Ser Tyr Glu He He Thr Pro Asp Val Pro 
200 205 210 

Phe Thr Val Asp Lys Asp Gly Tyr He Lys Asn Thr Glu Lys Leu 
215 220 225 

Asn Tyr Gly Lys Glu His Gin Tyr Lys Leu Thr Val Thr Ala Tyr 
230 235 240 

Asp Cys Gly Lys Lys Arg Ala Thr Glu Asp Val Leu Val Lys He 
245 250 255 

Ser He Lys Pro Thr Cy S Thr Pro Gly Trp Gin Gly Trp Asn Asn 
260 265 270 

Arg He Glu Tyr Glu Pro Gly Thr Gly Ala Leu Ala Val Phe Pro 
275 280 285 

Asn He His Leu Glu Thr Cys Asp Glu Pro Val Ala Ser Val Gin 
290 295 300 

Ala Thr Val Glu Leu Glu Thr Ser His He Gly Lys Gly Cys Asp 
305 310 315 

Arg Asp Thr Tyr Ser Glu Lys Ser Leu His Arg Leu Cys Gly Ala 
320 325 330 

Ala Ala Gly Thr Ala Glu Leu Leu Pro Ser Pro Ser Gly Ser Leu 
335 340 345 

Asn Trp Thr Met Gly Leu Pro Thr Asp Asn Gly His Asp Ser Asp 
350 355 360 

Gin Val Phe Glu Phe Asn Gly Thr Gin Ala Val Arg He Pro Asp 
365 370 375 

Gly Val Val Ser Val Ser Pro Lys Glu Pro Phe Thr He Ser Val 
380 385 390 

Trp Met Arg His Gly Pro Phe Gly Arg Lys Lys Glu Thr He Leu 
395 400 405 

Cys Ser Ser Asp Lys Thr Asp Met Asn Arg His His Tyr Ser Leu 
410 415 420 

Tyr Val His Gly Cys Arg Leu He Phe Leu Phe Arg Gin Asp Pro 
425 430 435 

Ser Glu Glu Lys Lys Tyr Arg Pro Ala Glu Phe His Trp Lys Leu 
440 445 450 

Asn Gin Val Cys Asp Glu Glu Trp His His Tyr Val Leu Asn Val 
455 460 465 

Glu Phe Pro Ser Val Thr Leu Tyr Val Asp Gly Thr Ser His Glu 
470 475 480 

Pro Phe Ser Val Thr Glu Asp Tyr Pro Leu His Pro Ser Lys He 
485 490 495 

Glu Thr Gin Leu Val Val Gly Ala Cys Trp Gin Gly Gly Asp Leu 
500 505 510 

His Met Thr Gin Phe Phe Arg Gly Asn Leu Ala Gly Leu Thr Leu 
515 520 525 

Arg Ser Gly Lys Leu Ala Asp Lys Lys Val He Asp Cys Leu Tyr 
530 535 540 

Thr Cys Lys Glu Gly Leu Asp Leu Gin Val Leu Glu Asp Ser Gly 



25/65 



WO 03/051902 



PCT/US02/40059 



545 550 555 

Arg Qly Val Gin lie Gin Ala His Pro Ser Gin Leu Val Leu Thr 
560 565 570 

Leu Glu Gly Glu Asp Leu Gly Glu Leu Asp Lys Ala Met Gin His 
575 580 585 

lie Ser Tyr Leu Asn Ser Arg Gin Phe Pro Thr Pro Gly He Arg 
590 595 * 600 

Arg Leu Lys He Thr Ser Thr He Lys Cys Phe Asn Glu Ala Thr 
605 610 615 

Cys He Ser Val Pro Pro Val Asp Gly Tyr Val Met Val Leu Gin 
620 625 630 

Pro Glu Glu Pro Lys He Ser Leu Ser Gly Val His His Phe Ala 
635 640 645 

Arg Ala Ala Ser Glu Phe Glu Ser Ser Glu Gly Val Phe Leu Phe 
650 655 660 

Pro Glu Leu Arg He He Ser Thr He Thr Arg Glu Val Glu Pro 
665 670 675 

Glu Gly Asp Gly Ala Glu Asp Pro Thr Val Gin Glu Ser Leu Val 
680 685 690 

Ser Glu Glu He Val His Asp Leu Asp Thr Cys Glu Val Thr Val 
695 700 705 

Glu Gly Glu Glu Leu Asn His Glu Gin Glu Ser Leu Glu Val Asp 
710 715 720 

Met Ala Arg Leu Gin Gin Lys Gly He Glu Val Ser Ser Ser Glu 
725 730 735 

Leu Gly Met Thr Phe Thr Gly Val Asp Thr Met Ala Ser Tyr Glu 
740 745 750 

Glu Val Leu His Leu Leu Arg Tyr Arg Asn Trp His Ala Arg Ser 
755 760 765 

Leu Leu Asp Arg Lys Phe Lys Leu He Cys Ser Glu Leu Asn Gly 
770 775 780 

Arg Tyr He Ser Asn Glu Phe Lys Val Glu Val Asn Val He His 
785 790 795 

Thr Ala Asn Pro Met Glu His Ala Asn His Met Ala Ala Gin Pro 
800 805 810 

Gin Phe Val His Pro Glu His Arg Ser Phe Val Asp Leu Ser Gly 
815 820 825 

His Asn Leu Ala Asn Pro His Pro Phe Ala Val Val Pro Ser Thr 
830 835 840 

Ala Thr Val Val He Val Val Cys Val Ser Phe Leu Val Phe Met 
845 850 855 

He He Leu Gly Val Phe Arg He Arg Ala Ala His Arg Arg Thr 
860 865 870 

Met Arg Asp Gin Asp Thr Gly Lys Glu Asn Glu Met Asp Trp Asp 
875 880 885 

Asp Ser Ala Leu Thr He Thr Val Asn Pro Met Glu Thr Tyr Glu 
890 895 900 

Asp Gin His Ser Ser Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu 
905 910 915 

Glu Ser Glu Asp Gly Glu Glu Glu Asp Asp He Thr Ser Ala Glu 
920 925 930 

Ser Glu Ser Ser Glu Val Glu Glu Gly Glu Gin Gly Asp Pro Gin 
935 940 945 

Asn Ala Thr Arg Gin Gin Gin Leu Glu Trp Asp Asp Ser Thr Leu 
950 955 960 

Ser Tyr 
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<210> 12 

<211> 953 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID Wo: 7509488CD1 

<400> 12 

Met Leu Arg Arg Pro Ala Pro Arg Ala Val Pro Pro Ala Ala Arg 
15 10 15 

Leu Leu Leu Ala Gly Leu Leu Cys Gly Gly Gly Val Trp Ala Ala 

20 25 30 

Arg Val Asn Lys His Lys Pro Trp Leu Glu Pro Thr Tyr His Gly 

35 40 45 

lie Val Thr Glu Asn Asp Asn Thr Val Leu Leu Asp Pro Pro Leu 

50 55 60 

lie Ala Leu Asp Lys Asp Ala Pro Leu Arg Phe Ala Gly Glu lie 

65 70 75 

Cys Gly Phe Lys lie His Gly Gin Asn Val Pro Phe Asp Ala Val 

80 85 90 

Val Val Asp Lys Ser Thr Gly Glu Gly Val He Arg Ser Lys Glu 

95 100 105 

Lys Leu Asp Cys Glu Leu Gin Lys Asp Tyr Ser Phe Thr He Gin 
110 115 120 

Ala Tyr Asp Cys Gly Lys Gly Pro Asp Gly Thr Asn Val Lys Lys 
125 130 135 

Ser His Lys Ala Thr Val His He Gin Val Asn Asp Val Asn Glu 
140 145 150 

Tyr Ala Pro Val Phe Lys Glu Lys Ser Tyr Lys Ala Thr Val He 

155 160 165 

Glu Gly Lys Gin Tyr Asp Ser He Leu Arg Val Glu Ala Val Asp 

170 175 180 

Ala Asp Cys Ser Pro Gin Phe Ser Gin He Cys Ser Tyr Glu He 

185 190 195 

He Thr Pro Asp Val Pro Phe Thr Val Asp Lys Asp Gly Tyr He 

200 205 210 

Lys Asn Thr Glu Lys Leu Asn Tyr Gly Lys Glu His Gin Tyr Lys 

215 220 225 

Leu Thr Val Thr Ala Tyr Asp Cys Gly Lys Lys Arg Ala Thr Glu 

230 235 240 

Asp Val Leu Val Lys He Ser He Lys Pro Thr Cys Thr Pro Gly 

245 250 255 

Trp Gin Gly Trp Asn Asn Arg He Glu Tyr Glu Pro Gly Thr Gly 

260 265 270 

Ala Leu Ala Val Phe Pro Asn He His Leu Glu Thr Cys Asp Glu 

275 280 285 

Pro Val Ala Ser Val Gin Ala Thr Val Glu Leu Glu Thr Ser His 

290 295 300 

He Gly Lys Gly Cys Asp Arg Asp Thr Tyr Ser Glu Lys Ser Leu 

305 310 315 

His Arg Leu Cys Gly Ala Ala Ala Gly Thr Ala Glu Leu Leu Pro 

320 325 330 
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Ser Pro Ser Gly" Ser Leu Asn Trp Thr Met Gly Leu Pro Thr Asp 
335 340 345 

Asn Gly His Asp Ser Asp Gin Val Phe Glu Phe Asn Gly Thr Gin 
350 355 360 

Ala Val Arg lie Pro Asp Gly Val Val Ser Val Ser Pro Lys Glu 
365 370 375 

Pro Phe Thr lie Ser Val Trp Met Arg His Gly Pro Phe Gly Arg 
380 385 390 

Lys Lys Glu Thr lie Leu Cys Ser Ser Asp Lys Thr Asp Met Asn 
395 400 405 

Arg His His Tyr Ser Leu Tyr Val His Gly Cys Arg Leu lie Phe 
410 415 420 

Leu Phe Arg Gin Asp Pro Ser Glu Glu Lys Lys Tyr Arg Pro Ala 
425 430 435 

Glu Phe His Trp Lys Leu Asn Gin Val Cys Asp Glu Glu Trp His 
440 445 450 

His Tyr Val Leu Asn Val Glu Phe Pro Ser Val Thr Leu Tyr Val 
455 460 465 

Asp Gly Thr Ser His Glu Pro Phe Ser Val Thr Glu Asp Tyr Pro 
470 475 480 

Leu His Pro Ser Lys lie Glu Thr Gin Leu Val Val Gly Ala Cys 
485 490 495 

Trp Gin Gly Gly Asp Leu His Met Thr Gin Phe Phe Arg Gly Asn 
500 505 510 

Leu Ala Gly Leu Thr Leu Arg Ser Gly Lys Leu Ala Asp Lys Lys 
515 520 525 

Val lie Asp Cys Leu Tyr Thr Cys Lys Glu Gly Leu Asp Leu Gin 
530 535 540 

Val Leu Glu Asp Ser Gly Arg Gly Val Gin He Gin Ala His Pro 
545 550 555 

Ser Gin Leu Val Leu Thr Leu Glu Gly Glu Asp Leu Gly Glu Leu 
560 565 570 

Asp Lys Ala Met Gin His He Ser Tyr Leu Asn Ser Arg Gin Phe 
575 580 585 

Pro Thr Pro Gly He Arg Arg Leu Lys He Thr Ser Thr He Lys 
590 595 600 

Cys Phe Asn Glu Ala Thr Cys He Ser Val Pro Pro Val Asp Gly 
605 610 615 

Tyr Val Met Val Leu Gin Pro Glu Glu Pro Lys He Ser Leu Ser 
i 620 625 630 

Gly Val His His Phe Ala Arg Ala Ala Ser Glu Phe Glu Ser Ser 
635 640 645 

Glu Gly Val Phe Leu Phe Pro Glu Leu Arg He He Ser Thr He 
650 655 660 

Thr Arg Glu Val Glu Pro Glu Gly Asp Gly Ala Glu Asp Pro Thr 
665 670 675 

Val Gin Glu Ser Leu Val Ser Glu Glu He Val His Asp Leu Asp 
680 685 690 

Thr Cys Glu Val Thr Val Glu Gly Glu Glu Leu Asn His Glu Gin 
695 700 705 

Glu Ser Leu Glu Val Asp Met Ala Arg Leu Gin Gin Lys Gly He 
710 715 720 

Glu Val Ser Ser Ser Glu Leu Gly Met Thr Phe Thr Gly Val Asp 
725 730 735 

Thr Met Ala Ser Tyr Glu Glu Val Leu His Leu Leu Arg Tyr Arg 
740 745 750 
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Asn Trp His Ala Arg Ser Leu Leu Asp Arg Lys Phe Lys Leu lie 
755 760 765 

Cys Ser Glu Leu Asn Gly Arg Tyr He Ser Asn Glu Phe Lys Val 
770 775 780 

Glu Val Asn Val He His Thr Ala Asn Pro Met Glu His Ala Asn 
785 790 795 

His Met Ala Ala Gin Pro Gin Phe Val His Pro Glu His Arg Ser 
800 805 810 

Phe Val Asp Leu Ser Gly His Asn Leu Ala Asn Pro His Pro Phe 
815 820 825 

Ala Val Val Pro Ser Thr Ala Thr Val Val He Val Val Cys Val 
830 835 840 

Ser Phe Leu Val Phe Met He He Leu Gly Val Phe Arg He Arg 
845 850 855 

Ala Ala His Arg Arg Thr Met Arg Asp Gin Asp Thr Gly Lys Glu 
860 865 870 

Asn Glu Met Asp Trp Asp Asp Ser Ala Leu Thr He Thr Val Asn 
875 880 885 

Pro Met Glu Thr Tyr Glu Asp Gin His Ser Ser Glu Glu Glu Glu 
890 895 900 

Glu Glu Glu Glu Glu Glu Glu Ser Glu Asp Gly Glu Glu Glu Asp 
905 910 915 

Asp He Thr Ser Ala Glu Ser Glu Ser Ser Glu Val Glu Glu Gly 
920 925 930 

Glu Gin Gly Asp Pro Gin Asn Ala Thr Arg Gin Gin Gin Leu Glu 
935 940 945 

Trp Asp Asp Ser Thr Leu Ser Tyr 
950 

<210> 13 

<211> 582 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID Wo: 7510060CD1 

<400> 13 

Met lie Phe Leu Thr Ala Leu Pro Leu Phe Trp He Met He Ser 
15 10 15 

Ala Ser Arg Gly Gly His Trp Gly Ala Trp Met Pro Ser Ser He 

20 25 30 

Ser Ala Phe Glu Gly Thr Cys Val Ser He Pro Cys Arg Phe Asp 

35 40 45 

Phe Pro Asp Glu Leu Arg Pro Ala Val Val His Gly Val Trp Tyr 

50 55 60 

Phe Asn Ser Pro Tyr Pro Lys Asn Tyr Pro Pro Val Val Phe Lys 

65 70 75 

Ser Arg Thr Gin Val Val His Glu Ser Phe Gin Gly Arg Ser Arg 

80 85 90 

Leu Leu Gly Asp Leu Gly Leu Arg Asn Cys Thr Leu Leu Leu Ser 

95 100 105 

Asn Val Ser Pro Glu Leu Gly Gly Lys Tyr Tyr Phe Arg Gly Asp 
110 115 120 

Leu Gly Gly Tyr Asn Gin Tyr Thr Phe Ser Glu His Ser Val Leu 
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125 130 135 

Asp lie Val Asn Thr Pro Asn He Val Val Pro Pro Glu Val Val 
140 145 150 

Ala Gly Thr Glu Val Glu Val Ser Cys Met Val Pro Asp Asn Cys 
155 160 165 

Pro Glu Leu Arg Pro Glu Leu Ser Trp Leu Gly His Glu Gly Leu 
170 175 180 

Gly Glu Pro Ala Val Leu Gly Arg Leu Arg Glu Asp Glu Gly Thr 
185 190 195 

Trp Val Gin Val Ser Leu Leu His Phe Val Pro Thr Arg Glu Ala 
200 205 210 

Asn Gly His Arg Leu Gly Cys Gin Ala Ser Phe Pro Asn Thr Thr 
215 220 225 

Leu Gin Phe Glu Gly Tyr Ala Ser Met Asp Val Lys Tyr Pro Pro 
230 235 240 

Val He Val Glu Met Asn Ser Ser Val Glu Ala He Glu Gly Ser 
245 250 255 

His Val Ser Leu Leu Cys Gly Ala Asp Ser Asn Pro Pro Pro Leu 
260 265 270 

Leu Thr Trp Met Arg Asp Gly Thr Val Leu Arg Glu Ala Val Ala 
275 280 285 

Glu Ser Leu Leu Leu Glu Leu Glu Glu Val Thr Pro Ala Glu Asp 
290 295 300 

Gly Val Tyr Ala Cys Leu Ala Glu Asn Ala Tyr Gly Gin Asp Asn 
305 310 315 

Arg Thr Val Gly Leu Ser Val Met Tyr Ala Pro Trp Lys Pro Thr 
320 325 330 

Val Asn Gly Thr Met Val Ala Val Glu Gly Glu Thr Val Ser He 
335 340 345 

Leu Cys Ser Thr Gin Ser Asn Pro Asp Pro He Leu Thr He Phe 
350 355 360 

Lys Glu Lys Gin He Leu Ser Thr Val He Tyr Glu Ser Glu Leu 
365 370 375 

Gin Leu Glu Leu Pro Ala Val Ser Pro Glu Asp Asp Gly Glu Tyr 
380 385 390 

Trp Cys Val Ala Glu Asn Gin Tyr Gly Gin Arg Ala Thr Ala Phe 
395 400 405 

Asn Leu Ser Val Glu Phe Ala Pro Val Leu Leu Leu Glu Ser His 
410 415 420 

Cys Ala Ala Ala Arg Asp Thr Val Gin Cys Leu Cys Val Val Lys 
425 430 435 

Ser Asn Pro Glu Pro Ser Val Ala Phe Glu Leu Pro Ser Arg Asn 
440 445 450 

Val Thr Val Asn Glu Ser Glu Arg Glu Phe Val Tyr Ser Glu Arg 
455 460 465 

Ser Gly Leu Val Leu Thr Ser He Leu Thr Leu Arg Gly Gin Ala 
470 475 480 

Gin Ala Pro Pro Arg Val He Cys Thr Ala Arg Asn Leu Tyr Gly 
485 490 495 

Ala Lys Ser Leu Glu Leu Pro Phe Gin Gly Ala His Arg Leu Met 
500 505 510 

Trp Ala Lys He Gly Pro Val Gly Ala Val Val Ala Phe Ala He 
515 520 525 

Leu He Ala He Val Cys Tyr He Thr Gin Thr Arg Arg Lys Lys 
530 535 540 

Asn Val Thr Glu Ser Pro Ser Phe Ser Ala Gly Asp Asn Pro Pro 
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545 550 555 

Val Leu Phe Ser Ser Asp Phe Arg lie Ser Gly Ala Pro Glu Lys 

560 565 570 

Tyr Glu Ser Lys Glu Val Ser Thr Leu Glu Ser His 

575 580 



<210> 14 
<211> 188 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510226CD1 

<400> 14 

Met Met Glu Glu Arg Ala Ala Ala Ala Val Ala Ala Ala Ala Ser 
15 10 15 

Ser Cys Arg Pro Leu Gly Ser Gly Ala Gly Pro Gly Pro Thr Gly 

20 25 30 

Ala Ala Pro Val Ser Ala Pro Ala Pro Gly Pro Gly Pro Ala Gly 

35 40 45 

Lys Gly Gly Gly Gly Gly Gly Ser Pro Gly Pro Thr Ala Gly Pro 
50 55 60 

Glu Pro Leu Ser Leu Pro Gly lie Leu His Phe lie Gin His Glu 
65 70 75 

Trp Ala Arg Phe Glu Ala Glu Lys Ala Arg Trp Glu Ala Glu Arg 
80 85 90 

Ala Glu Leu Gin Ala Gin Val Ala Phe Leu Gin Gly Glu Arg Lys 
95 100 105 

Gly Gin Glu Asn Leu Lys Thr Asp Leu Val Arg Arg lie Lys Met 

110 115 120 

Leu Glu Tyr Ala Leu Lys Gin Glu Arg Ala Lys Tyr His Lys Leu 

125 130 135 

Lys Phe Gly Thr Asp Leu Asn Gin Gly Glu Lys Lys Ala Asp Val 

140 145 ■ 150 

Ser Glu Gin Val Ser Asn Gly Pro Val Glu Ser Val Thr Leu Glu 

155 160 165 

Asn Ser Pro Leu Val Trp Lys Glu Gly Arg Gin Leu Leu Arg His 

170 175 180 

Pro Val Pro Ala Gly Phe Ser Phe 

185 



<210> 15 
<211> 478 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510385CD1 



<400> 15 

Met Gly Arg Ala Gly Ala Ala Ala Val lie Pro Gly Leu Ala Leu 
1 5 10 15 

Leu Trp Ala Val Gly Leu Gly Ser Ala Ala Pro Ser Pro Pro Arg 
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20 25 30 

Leu Arg Leu Ser Phe Gin Glu Leu Gin Ala Trp His Gly Leu Gin 
35 40 45 

Thr Phe Ser Leu Glu Arg Thr Cys Cys Tyr Gin Ala Leu Leu Val 
50 55 60 

Asp Glu Glu Arg Gly Arg Leu Phe Val Gly Ala Glu Asn His Val 
65 70 75 

Ala Ser Leu Asn Leu Asp Asn lie Ser Lys Arg Ala Lys Lys Leu 
80 85 90 

Ala Trp Pro Ala Pro Val Glu Trp Arg Glu Glu Cys Asn Trp Ala 
95 100 105 

Gly Lys Asp lie Gly Thr Glu Cys Met Asn Phe Val Lys Leu Leu 
110 115 120 

His Ala Tyr Asn Arg Thr His Leu Leu Ala Cys Gly Thr Gly Ala 
125 130 135 

Phe His Pro Thr Cys Ala Phe Val Glu Val Gly His Arg Ala Glu 
140 145 150 

Glu Pro Val Leu Arg Leu Asp Pro Gly Arg lie Glu Asp Gly Lys 
155 160 165 

Gly Lys Ser Pro Tyr Asp Pro Arg His Arg Ala Ala Ser Val Leu 
170 175 180 

Val Gly Glu Glu Leu Tyr Ser Gly Val Ala Ala Asp Leu Met Gly 
185 190 195 

Arg Asp Phe Thr lie Phe Arg Ser Leu Gly Gin Arg Pro Ser Leu 
200 205 210 

Arg Thr Glu Pro His Asp Ser Arg Trp Leu Asn Glu Pro Lys Phe 
215 220 225 

Val Lys Val Phe Trp lie Pro Glu Ser Glu Asn Pro Asp Asp Asp 
230 235 240 

Lys lie Tyr Phe Phe Phe Arg Glu Thr Ala Val Glu Ala Ala Pro 
245 250 255 

Ala Leu Gly Arg Leu Ser Val Ser Arg Val Gly Gin lie Cys Arg 
260 265 270 

Asn Asp Val Gly Gly Gin Arg Ser Leu Val Asn Lys Trp Thr Thr 
275 280 285 

Phe Leu Lys Ala Arg Leu Val Cys Ser Val Pro Gly Val Glu Gly 
290 295 300 

Asp Thr His Phe Asp Gin Leu Arg Glu Cys Gly Ser Gly Tyr Gly 
305 310 315 

Val Gly Glu Gly Gly Ser Gly Ala Asp Ser Gly Ser Pro Arg Arg 
320 325 330 

Ser Gly Ala Val Arg Pro Thr Pro Ala Arg Pro Leu Pro Arg Arg 
335 340 345 

Gly Cys Val Ser Val Val Leu Ala Gly Pro Pro Asp Pro Ala Ala 
350 355 360 

Leu Cys Arg Leu Leu His Val Gin Gin His Leu Pro Gly Leu Cys 
365 370 375 

Gly Val Arg Val Gin His Glu Arg Arg Ala Pro Gly Leu Leu Gly 
380 385 390 

Thr Leu Cys Thr Gin Gly Gly Ala His Ala Pro Val Gly Val He 
395 400 405 

Pro Gly Ser Arg Pro Leu Pro Ala Ala Arg His Val Pro Gin Gin 
410 415 420 

Asp Leu Trp His Leu Gin Phe His Gin Gly Leu Pro Arg Arg Cys 
425 430 435 

His Pro Val Cys Ala Glu Pro Pro Pro His Val Gin Leu Cys Pro 
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440 445 450 

Ala His Trp Gly Ala Pro Ser Phe Pro Thr Ser Trp Ser Gin Leu 

455 460 465 

His Leu His Ser Asn Cys Arg Gly Pro Gly Cys Ser Arg 

470 475 

<210> 16 
<211> 412 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7511618CD1 

<400> 16 

Met Ala Glu Ser Gly Glu Ser Gly Gly Pro Pro Gly Ser Gin Asp 
15 10 15 

Ser Ala Ala Gly Ala Glu Gly Ala Gly Ala Pro Ala Ala Ala Ala 
20 25 30 

Ser Ala Glu Pro Lys lie Met Lys Val Thr Val Lys Thr Pro Lys 
35 40 45 

Glu Lys Glu Glu Phe Ala Val Pro Glu Asn Ser Ser Val Gin Gin 
50 55 60 

Thr Leu Glu Leu Ala Arg Asn Pro Ala Met Met Gin Glu Met Met 
65 70 75 

Arg Asn Gin Asp Arg Ala Leu Ser Asn Leu Glu Ser He Pro Gly 
80 85 90 

Gly Tyr Asn Ala Leu Arg Arg Met Tyr Thr Asp He Gin Glu Pro 
95 100 105 

Met Leu Ser Ala Ala Gin Glu Gin Phe Gly Gly Asn Pro Phe Ala 

110 115 120 

Ser Leu Val Ser Asn Thr Ser Ser Gly Glu Gly Ser Gin Pro Ser 

125 130 135 

Arg Thr Glu Asn Arg Asp Pro Leu Pro Asn Pro Trp Ala Pro Gin 

140 145 150 

Thr Ser Gin Ser Ser Ser Ala Ser Ser Gly Thr Ala Ser Thr Val 

155 160 165 

Gly Gly Thr Thr Gly Ser Thr Ala Ser Gly Thr Ser Gly Gin Ser 

170 175 180 

Thr Thr Ala Pro Asn Leu Val Pro Gly Val Gly Ala Ser Met Phe 

185 190 195 

Asn Thr Pro Gly Met Gin Ser Leu Leu Gin Gin He Thr Glu Asn 

200 205 210 

Pro Gin Leu Met Gin Asn Met Leu Ser Ala Pro Tyr Met Arg Ser 

215 220 225 

Met Met Gin Ser Leu Ser Gin Asn Pro Asp Leu Ala Ala Gin Met 

230 235 240 

Met Leu Asn Asn Pro Leu Phe Ala Gly Asn Pro Gin Leu Gin Glu 

245 250 255 

Gin Met Arg Gin Gin Leu Pro Thr Phe Leu Gin Gin Met Gin Asn 

260 265 270 

Pro Asp Thr Leu Ser Ala Met Ser Asn Pro Arg Ala Met Gin Ala 

275 280 285 

Leu Leu Gin He Gin Gin Gly Leu Gin Thr Leu Ala Thr Glu Ala 

290 295 300 
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Pro Gly Leu He Pro Gly Phe Thr Pro Gly Leu Gly Ala Leu Gly 

305 310 315 

Ser Thr Gly Gly Ser Ser Gly Thr Asn Gly Ser Asn Ala Thr Pro 

320 325 330 

Ser Glu Asn Thr Ser Pro Thr Ala Gly Thr Thr Glu Pro Gly His 

335 340 345 

Gin Gin Phe He Gin Gin Met Leu Gin Ala Leu Ala Gly Val Asn 

350 355 360 

Pro Gin Leu Gin Asn Pro Glu Val Arg Phe Gin Gin Gin Leu Glu 

365 370 375 

Gin Leu Ser Ala Met Gly Phe Leu Asn Arg Glu Ala Asn Leu Gin 

380 385 390 

Ala Leu He Ala Thr Gly Gly Asp He Asn Ala Ala He Glu Arg 

395 400 405 

Leu Leu Gly Ser Gin Pro Ser 

410 

<210> 17 
<211> 136 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6244135CD1 

<400> 17 

Met Met Asp Gin Glu Glu Lys Thr Glu Glu Gly Ser Gly Pro Cys 
1 5 10 15 

Ala Glu Ala Gly Ser Pro Asp Gin Glu Gly Phe Phe Asn Leu Leu 

20 25 30 

Ser His Val Gin Gly Asp Arg Met Glu Gly Gin Arg Cys Ser Leu 

35 40 45 

Gin Ala Gly Pro Gly Gin Thr Thr Lys Ser Gin Ser Asp Pro Thr 

50 55 60 

Pro Glu Met Asp Ser Leu Met Asp Met Leu Ala Ser Thr Gin Gly 

65 70 75 

Arg Arg Met Asp Asp Gin Arg Val Thr Val Ser Ser Leu Pro Gly 

80 85 90 

Phe Gin Pro Val Gly Ser Lys Asp Gly Ala Gin Lys Arg Ala Gly 

95 100 105 

Thr Leu Ser Pro Gin Pro Leu Leu Thr Pro Gin Asp Pro Thr Ala 
110 115 120 

Leu Gly Phe Arg Arg Asn Ser Ser Pro Gin Pro Pro Thr Gin Ala 
125 130 135 

Pro 



<210> 18 
<211> 198 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7506689CD1 
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<400> 18 

Met Pro Ala Ser Ala Ala Arg Pro Arg Pro Gly Pro Gly Gin Pro 
15 10 15 

Thr Ala Ser Pro Phe Pro Leu Leu Leu Leu Ala Val Leu Ser Gly 
20 25 30 

Pro Val Ser Gly Arg Val Pro Arg Ser Val Pro Arg Thr Ser Leu 
35 40 45 

Pro lie Ser Glu Ala Asp Ser Cys Leu Thr Arg Phe Ala Val Pro 
50 55 60 

His Thr Tyr Asn Tyr Ser Val Leu Leu Val Asp Pro Ala Ser His 
65 70 75 

Thr Leu Tyr Val Gly Ala Arg Asp Thr lie Phe Ala Leu Ser Leu 
80 85 ■ 90 

Pro Phe Ser Gly Glu Arg Pro Arg Arg lie Asp Trp Met Val Pro 
95 100 105 

Glu Ala His Arg Gin Asn Cys Arg Lys Lys Gly Lys Lys Glu Gly 
110 115 120 

Asp Leu Gly Gly Arg Lys Thr Leu Gin Gin Arg Trp Thr Thr Phe 
125 130 135 

Leu Lys Ala Asp Leu Leu Cys Pro Gly Leu Ser Met Ala Gly Pro 
140 145 150 

Pro Val Ser Cys Arg Met Leu Leu Cys Phe Asp Leu Ser Leu Gly 
155 160 165 

Gin Gly Leu Pro Ser Phe Met Ala Ser Phe Leu Pro Ser Gly Arg 
170 175 180 

Gly Leu Leu Ser Leu Leu Ser Val Pro Ser Asp His Lys Thr Phe 
185 190 195 

Gly Gin Cys 



<210> 19 
<211> 813 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7510185CD1 

<400> 19 

Met Ala Val Gin Arg Ala Ala Ser Pro Arg Arg Pro Pro Ala Pro 
15 10 15 

Leu Trp Pro Arg Leu Leu Leu Pro Leu Leu Leu Leu Leu Leu Pro 

20 25 30 

Ala Pro Ser Glu Gly Leu Gly His Ser Ala Glu Leu Ala Phe Ala 

35 40 45 

Val Glu Pro Ser Asp Asp Val Ala Val Pro Gly Gin Pro lie Val 

50 55 60 

Leu Asp Cys Arg Val Glu Gly Thr Pro Pro Val Arg lie Thr Trp 

65 70 75 

Arg Lys Asn Gly Val Glu Leu Pro Glu Ser Thr His Ser Thr Leu 

80 85 90 

Leu Ala Asn Gly Ser Leu Met lie Arg His Phe Arg Leu Glu Pro 

95 100 105 

Gly Gly Ser Pro Ser Asp Glu Gly Asp Tyr Glu Cys Val Ala Gin 
110 115 120 



35/65 



WO 03/051902 



PCT/US02/40059 



Asn Arg Phe Gly Leu Val Val Ser Arg Lys Ala Arg lie Gin Ala 
125 130 135 

Ala Thr Met Ser Asp Phe His Val His Pro Gin Ala Thr Val Gly 
140 145 150 

Glu Glu Gly Gly Val Ala Arg Phe Gin Cys Gin He His Gly Leu 
155 160 165 

Pro Lys Pro Leu He Thr Trp Glu Lys Asn Arg Val Pro He Asp 
170 175 180 

Thr Asp Asn Glu Arg Tyr Thr Leu Leu Pro Lys Gly Val Leu Gin 
185 190 195 

He Thr Gly Leu Arg Ala Glu Asp Gly Gly He Phe His Cys Val 
200 205 210 

Ala Ser Asn He Ala Ser He Arg He Ser His Gly Ala Arg Leu 
215 220 225 

Thr Val Ser Gly Ser Gly Ser Gly Ala Tyr Lys Glu Pro Ala He 
230 235 240 

Leu Val Gly Pro Glu Asn Leu Thr Leu Thr Val His Gin Thr Ala 
245 250 255 

Val Leu Glu Cys Val Ala Thr Gly Asn Pro Arg Pro He Val Ser 
260 265 270 

Trp Ser Arg Leu Asp Gly Arg Pro He Gly Val Glu Gly He Gin 
275 280 285 

Val Leu Gly Thr Gly Asn Leu He He Ser Asp Val Thr Val Gin 
290 295 300 

His Ser Gly Val Tyr Val Cys Ala Ala Asn Arg Pro Gly Thr Arg 
305 310 315 

Val Arg Arg Thr Ala Gin Gly Arg Leu Val Val Gin Ala Pro Ala 
320 325 330 

Glu Phe Val Gin His Pro Gin Ser He Ser Arg Pro Ala Gly Thr 
335 340 345 

Thr Ala Met Phe Thr Cys Gin Ala Gin Gly Glu Pro Pro Pro His 
350 355 360 

Val Thr Trp Leu Lys Asn Gly Gin Val Leu Gly Pro Gly Gly His 
365 370 375 

Val Arg Leu Lys Asn Asn Asn Ser Thr Leu Thr He Ser Gly He 
380 385 390 

Gly Pro Glu Asp Glu Ala He Tyr Gin Cys Val Ala Glu Asn Ser 
395 400 405 

Ala Gly Ser Ser Gin Ala Ser Ala Arg Leu Thr Val Leu Trp Ala 
410 415 420 

Glu Gly Leu Pro Gly Pro Pro Arg Asn Val Arg Ala Val Ser Val 
425 430 435 

Ser Ser Thr Glu Val Arg Val Ser Trp Ser Glu Pro Leu Ala Asn 
440 445 450 

Thr Lys Glu He He Gly Tyr Val Leu His He Arg Lys Ala Ala 
455 460 465 

Asp Pro Pro Glu Leu Glu Tyr Gin Glu Ala Val Ser Lys Ser Thr 
470 475 480 

Phe Gin His Leu Val Ser Asp Leu Glu Pro Ser Thr Ala Tyr Ser 
485 490 495 

Phe Tyr He Lys Ala Tyr Thr Pro Arg Gly Ala Ser Ser Ala Ser 
500 505 510 

Val Pro Thr Leu Ala Ser Thr Leu Gly Glu Ala Pro Ala Pro Pro 
515 520 525 

Pro Leu Ser Val Arg Val Leu Gly Ser Ser Ser Leu Gin Leu Leu 
530 535 540 
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Trp Glu Pro Trp Pro Arg Leu Ala Gin His Glu Gly Gly Phe Lys 

545 550 555 

Leu Phe Tyr Arg Pro Ala Ser Lys Thr Ser Phe Thr Gly Pro He 

560 565 570 

Leu Leu Pro Gly Thr Val Ser Ser Tyr Asn Leu Ser Gin Leu Asp 

575 580 585 

Pro Thr Ala Val Tyr Glu Val Lys Leu Leu Ala Tyr Asn Gin His 

590 595 600 

Gly Asp Gly Asn Ala Thr Val Arg Phe Val Ser Leu Arg Gly Ala 

605 610 615 

Ser Glu Arg Thr Ala Leu Ser Pro Pro Cys Asp Cys Arg Lys Glu 

620 625 630 

Glu Ala Ala Asn Gin Thr Ser Thr Thr Gly He Val He Gly He 

635 640 645 

His He Gly Val Thr Cys He He Phe Cys Val Leu Phe Leu Leu 

650 655 660 

Phe Gly Gin Arg Gly Arg Val Leu Leu Cys Lys Asp Val Glu Asn 

665 670 675 

Gin Leu Ser Pro Pro Gin Gly Pro Arg Ser Gin Arg Asp Pro Gly 

680 685 690 

He Leu Ala Leu Asn Gly Ala Arg Arg Gly Gin Arg Gly Gin Leu 

695 700 705 

Gly Arg Asp Glu Lys Arg Val Asp Met Lys Glu Leu Glu Gin Leu 

710 715 720 

Phe Pro Pro Ala Ser Ala Ala Gly Gin Pro Asp Pro Arg Pro Thr 

725 730 735 

Asp Pro Ala Ala Pro Ala Pro Cys Glu Glu Thr Gin Leu Ser Leu 

740 745 750 

Leu Pro Leu Gin Gly Cys Gly Leu Met Glu Gly Lys Thr Thr Glu 

755 760 765 

Ala Lys Thr Thr Glu Ala Thr Ala Pro Cys Ala Gly Leu Ala Ala 

770 775 780 

Ala Pro Pro Pro Pro Asp Gly Gly Pro Gly Leu Leu Ser Glu Gly 

785 790 795 

Gin Ala Ser Arg Pro Ala Ala Ala Arg Val Thr Gin Pro Ala His 

800 805 810 

Ser Glu Gin 



<210> 20 

<211> 165 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> itiisc_feature 

<223> Incyte ID No: 1420867CD1 

<400> 20 

Met Trp He Asp He Phe Pro Gin Asp Val Pro Ala Pro Pro Pro 
15 10 15 

Val Asp He Lys Pro Arg Gin Pro He Ser Tyr Glu Leu Arg Val 

20 25 30 

Val He Trp Asn Thr Glu Asp Val Val Leu Asp Asp Glu Asn Pro 

35 40 45 

Leu Thr Gly Glu Met Ser Ser Asp He Tyr Val Lys Ser Trp Val 
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50 



55 



60 



Lys Gly Leu Glu His Asp Lys Gin Glu Thr Asp Val His Phe Asn 

65 70 75 

Ser Leu Thr Gly Glu Gly Asn Phe Asn Trp Arg Phe Val Phe Arg 

80 85 90 

Phe Asp Tyr Leu Pro Thr Glu Arg Glu Val Ser Val Arg Arg Arg 

95 100 105 

Ser Gly Pro Phe Ala Leu Glu Glu Ala Glu Phe Arg Gin Pro Ala 

110 115 120 

Val Leu Val Leu Gin Asp Pro Trp Ser Cys Ser Tyr Gin Thr Trp 

125 130 135 

Cys Met Gly Pro Gly Ala Pro Ser Ser Ala Leu Cys Ser Trp Pro 

140 145 150 

Ala Met Gly Pro Gly Arg Gly Ala lie Cys Phe Ala Ala Ala Ala 



<210> 21 
<211> 229 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7512289CD1 

<400> 21 

Met Val Lys Val Thr Phe Asn Ser Ala Leu Ala Gin Lys Glu Ala 
15 10 15 

Lys Lys Asp Glu Pro Lys Ser Gly Glu Glu Ala Leu lie He Pro 
20 25 30 

Pro Asp Ala Val Ala Val Asp Cys Lys Asp Pro Asp Asp Val Val 
35 40 45 

Pro Val Gly Gin Arg Arg Ala Trp Cys Trp Cys Met Cys Phe Gly 
50 55 60 

Leu Ala Phe Met Leu Ala Gly Val He Leu Gly Gly Ala Tyr Leu 
65 70 75 

Tyr Lys Tyr Phe Ala Leu Gin Pro Asp Asp Val Tyr Tyr Cys Gly 
80 85 90 

He Lys Tyr He Lys Asp Asp Val He Leu Asn Glu Pro Ser Ala 
95 100 105 

Asp Ala Pro Ala Ala Leu Tyr Gin Thr He Glu Glu Asn He Lys 
110 115 120 

He Phe Glu Glu Glu Glu Val Glu Phe He Ser Val Pro Val Pro 
125 130 135 

Glu Phe Ala Asp Ser Asp Pro Ala Asn He Val His Asp Phe Asn 
140 145 150 

Lys Ala Gly Thr Tyr Leu Pro Gin Ser Tyr Leu He His Glu His 
155 160 165 

Met Val He Thr Asp Arg He Glu Asn He Asp His Leu Gly Phe 
170 175 180 

Phe He Tyr Arg Leu Cys His Asp Lys Glu Thr Tyr Lys Leu Gin 
185 190 195 

Arg Arg Glu Thr He Lys Gly He Gin Lys Arg Glu Ala Ser Asn 
200 205 210 

Cys Phe Ala He Arg His Phe Glu Asn Lys Phe Ala Val Glu Thr 



155 



160 



165 
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215 



220 



225 



Leu lie Cys Ser 



<210> 22 

<211> 1163 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7512447CD1 

<400> 22 

Met Val Trp Cys Leu Gly Leu Ala Val Leu Ser Leu Val lie Ser 
1 * ~ 5 10 15 

Gin Gly Ala Asp Gly Arg Gly Lys Pro Glu Val Val Ser Val Val 
20 25 30 

Gly Arg Ala Gly Glu Ser Val Val Leu Gly Cys Asp Leu Leu Pro 
35 40 45 

Pro Ala Gly Arg Pro Pro Leu His Val He Glu Trp Leu Arg Phe 
50 55 60 

Gly Phe Leu Leu Pro He Phe He Gin Phe Gly Leu Tyr Ser Pro 
65 70 75 

Arg He Asp Pro Asp Tyr Val Gly Arg Val Arg Leu Gin Lys Gly 
80 85 90 

Ala Ser Leu Gin He Glu Gly Leu Arg Val Glu Asp Gin Gly Trp 
95 100 105 

Tyr Glu Cys Arg Val Phe Phe Leu Asp Gin His He Pro Glu Asp 
110 115 120 

Asp Phe Ala Asn Gly Ser Trp Val Arg Leu Thr Val Asn Ser Pro 
125 130 135 

Pro Gin Phe Gin Glu Thr Pro Pro Ala Val Leu Glu Val Gin Glu 
140 145 150 

Leu Glu Pro Val Thr Leu Arg Cys Val Ala Arg Gly Ser Pro Leu 
155 160 165 

Pro His Val Thr Trp Lys Leu Arg Gly Lys Asp Leu Gly Gin Gly 
170 175 180 

Gin Gly Gin Val Gin Val Gin Asn Gly Thr Leu Arg He Arg Arg 
185 190 195 

Val Glu Arg Gly Ser Ser Gly Val Tyr Thr Cys Gin Ala Ser Ser 
200 205 210 

Thr Glu Gly Ser Ala Thr His Ala Thr Gin Leu Leu Val Leu Gly 
215 220 225 

Pro Pro Val He Val Val Pro Pro Lys Asn Ser Thr Val Asn Ala 
230 235 240 

Ser Gin Asp Val Ser Leu Ala Cys His Ala Glu Ala Tyr Pro Ala 
245 250 255 

Asn Leu Thr Tyr Ser Trp Phe Gin Asp Asn He Asn Val Phe His 
260 265 270 

He Ser Arg Leu Gin Pro Arg Val Arg He Leu Val Asp Gly Ser 
275 280 285 

Leu Arg Leu Leu Ala Thr Gin Pro Asp Asp Ala Gly Cys Tyr Thr 
290 295 300 

Cys Val Pro Ser Asn Gly Leu Leu His Pro Pro Ser Ala Ser Ala 



305 



310 



315 
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Tyr Leu Thr Val Leu Cys Met Pro Qly Val He Arg Cys Pro Val 
320 325 330 

Arg Ala Asn Pro Pro Leu Leu Phe Val Ser Trp Thr Lys Asp Gly 
335 340 345 

Lys Ala Leu Gin Leu Asp Lys Phe Pro Gly Trp Ser Gin Gly Thr 
350 355 360 

Glu Gly Ser Leu He He Ala Leu Gly Asn Glu Asp Ala Leu Gly 
365 370 375 

Glu Tyr Ser Cys Thr Pro Tyr Asn Ser Leu Gly Thr Ala Gly Pro 
380 385 390 

Ser Pro Val Thr Arg Val Leu Leu Lys Ala Pro Pro Ala Phe He 
395 400 405 

Glu Arg Pro Lys Glu Glu Tyr Phe Gin Glu Val Gly Arg Glu Leu 
410 415 420 

Leu He Pro Cys Ser Ala Gin Gly Asp Pro Pro Pro Val Val Ser 
425 430 435 

Trp Thr Lys Val Gly Arg Gly Leu Gin Gly Gin Ala Gin Val Asp 
440 445 450 

Ser Asn Ser Ser Leu He Leu Arg Pro Leu Thr Lys Glu Ala His 
455 460 465 

Gly His Trp Glu Cys Ser Ala Ser Asn Ala Val Ala Arg Val Ala 
470 475 480 

Thr Ser Thr Asn Val Tyr Val Leu Gly Thr Ser Pro His Val Val 
485 490 495 

Thr Asn Val Ser Val Val Ala Leu Pro Lys Gly Ala Asn Val Ser 
500 505 510 

Trp Glu Pro Gly Phe Asp Gly Gly Tyr Leu Gin Arg Phe Ser Val 
515 520 525 

Trp Tyr Thr Pro Leu Ala Lys Arg Pro Asp Arg Met His His Asp 
530 535 540 

Trp Val Ser Leu Ala Val Pro Val Gly Ala Ala His Leu Leu Val 
545 550 555 

Pro Gly Leu Gin Pro His Thr Gin Tyr Gin Phe Ser Val Leu Ala 
560 565 570 

Gin Asn Lys Leu Gly Ser Gly Pro Phe Ser Glu He Val Leu Ser 
575 580 585 

Ala Pro Glu Gly Leu Pro Thr Thr Pro Ala Ala Pro Gly Leu Pro 
590 595 600 

Pro Thr Glu He Pro Pro Pro Leu Ser Pro Pro Arg Gly Leu Val 
605 610 615 

Ala Val Arg Thr Pro Arg Gly Val Leu Leu His Trp Asp Pro Pro 
620 625 630 

Glu Leu Val Pro Lys Arg Leu Asp Gly Tyr Val Leu Glu Gly Arg 
635 640 645 

Gin Gly Ser Gin Gly Trp Glu Val Leu Asp Pro Ala Val Ala Gly 
650 655 660 

Thr Glu Thr Glu Leu Leu Val Pro Gly Leu He Lys Asp Val Leu 
665 670 675 

Tyr Glu Phe Arg Leu Val Ala Phe Ala Gly Ser Phe Val Ser Asp 
680 685 690 

Pro Ser Asn Thr Ala Asn Val Ser Thr Ser Gly Leu Glu Val Tyr 
695 700 705 

Pro Ser Arg Thr Gin Leu Pro Gly Leu Leu Pro Gin Pro Val Leu 
710 715. 720 

Ala Gly Val Val Gly Gly Val Cys Phe Leu Gly Val Ala Val Leu 
725 730 735 
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Val Ser He Leu Ala Gly Cys Leu Leu Asn Arg Arg Arg Ala Ala 
740 745 750 

Arg Arg Arg Arg Lys Arg Leu Arg Gin Asp Pro Pro Leu He Phe 
755 760 765 

Ser Pro Thr Gly Lys Ser Ala Ala Pro Ser Ala Leu Gly Ser Gly 
770 775 780 

Ser Pro Asp Ser Val Ala Lys Leu Lys Leu Gin Gly Ser Pro Val 
785 790 795 

Pro Ser Leu Arg Gin Ser Leu Leu Trp Gly Asp Pro Ala Gly Thr 
800 805 810 

Pro Ser Pro His Pro Asp Pro Pro Ser Ser Arg Gly Pro Leu Pro 
815 820 825 

Leu Glu Pro He Cys Arg Gly Pro Asp Gly Arg Phe Val Met Gly 
830 835 840 

Pro Thr Val Ala Ala Pro Gin Glu Arg Ser Gly Arg Glu Gin Ala 
845 850 855 

Glu Pro Arg Thr Pro Ala Gin Arg Leu Ala Arg Ser Phe Asp Cys 
860 865 870 

Ser Ser Ser Ser Pro Ser Gly Ala Pro Gin Pro Leu Cys He Glu 
875 880 885 

Asp lie Ser Pro Val Ala Pro Pro Pro Ala Ala Pro Pro Ser Pro 
890 895 900 

Leu Pro Gly Pro Gly Pro Leu Leu Gin Tyr Leu Ser Leu Pro Phe 
905 910 915 

Phe Arg Glu Met Asn Val Asp Gly Asp Trp Pro Pro Leu Glu Glu 
920 925 930 

Pro Ser Pro Ala Ala Pro Pro Asp Tyr Met Asp Thr Arg Arg Cys 
935 940 945 

Pro Thr Ser Ser Phe Leu Arg Ser Pro Glu Thr Pro Pro Val Ser 
950 955 960 

Pro Arg Glu Ser Leu Pro Gly Ala Val Val Gly Ala Gly Ala Thr 
965 970 975 

Ala Glu Pro Pro Tyr Thr Ala Leu Ala Asp Trp Thr Leu Arg Glu 
980 985 990 

Arg Leu Leu Pro Gly Leu Leu Pro Ala Ala Pro Arg Gly Ser Leu 
995 1000 1005 

Thr Ser Gin Ser Ser Gly Arg Gly Ser Ala Ser Phe Leu Arg Pro 
1010 1015 1020 

Pro Ser Thr Ala Pro Ser Ala Gly Gly Ser Tyr Leu Ser Pro Ala 
1025 1030 1035 

Pro Gly Asp Thr Ser Ser Trp Ala Ser Gly Pro Glu Arg Trp Pro 
1040 1045 1050 

Arg Arg Glu His Val Val Thr Val Ser Lys Arg Arg Asn Thr Ser 
1055 1060 1065 

Val Asp Glu Asn Tyr Glu Trp Asp Ser Glu Phe Pro Gly Asp Met 
1070 1075 1080 

Glu Leu Leu Glu Thr Leu His Leu Gly Leu Ala Ser Ser Arg Leu 
1085 1090 1095 

Arg Pro Glu Ala Glu Pro Glu Leu Gly Val Lys Thr Pro Glu Glu 
1100 1105 1110 

Gly Cys Leu Leu Asn Thr Ala His Val Thr Gly Pro Glu Ala Arg 
1115 1120 1125 

Cys Ala Ala Leu Arg Glu Glu Phe Leu Ala Phe Arg Arg Arg Arg 
1130 1135 1140 

Asp Ala Thr Arg Ala Arg Leu Pro Ala Tyr Arg Gin Pro Val Pro 
1145 1150 1155 
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His Pro Glu Gin Ala Thr Leu Leu 
1160 

<210> 23 

<211> 2208 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_ feature 

<223> Incyte ID No: 7506758CB1 

<400> 23 

gaagcttgag aaaatctttc tcttttatat gtaaaacagt gcagaggata tctagcaagt 60 
attaaaattt catggaaggg ggcagagatt aagaaaaatg tgaaaaggct ggctgcagct 120 
ggcctccgcc gctcatctgc agggcgtgag cgcttggtcc atgcagtgaa gctcttccaa 180 
cctgggtcaa cgaaaacgga gaagaaatgg cccaagaaat agatctgagt gctctcaagg 240 
agttagaacg cgaggccatt ctccaggtcc tgtaccgaga ccaggcggtt caaaacacag 300 
aggaggagag gacacggaaa ctgaaaacac acctgcagca tctccggtgg aaaggagcga 360 
agaacacgga ctgggagcac aaagagaagt gctgtgcgcg ctgccagcag gtgctggggt 420 
tcctgctgca ccggggcgcc gtgtgccggg gctgcagcca ccgcgtgtgt gcccagtgcc 480 
gagtgttcct gagggggacc catgcctgga agtgcacggt gtgcttcgag gacaggaatg 540 
tcaaaataaa aactggagaa tggttctatg aggaacgagc caagaaattt ccaactggag 600 
aaggtcagtg caccagatat tctgaaacct ctcaatcaag aggatcccaa atgctctact 660 
aaccctattt tgaagcaaca gaatctccca tccagtccgg cacccagtac catattctct 720 
ggaggtttta gacacggaag tttaattagc attgacagca cctgtacaga gatgggcaat 780 
tttgacaatg ctaatgtcac tggagaaata gaatttgcca ttcattattg cttcaaaacc 840 
cattctttag aaatatgcat caaggcctgt aagaaccttg cctatggaga agaaaagaag 900 
aaaaagtgca atccgtatgt gaagacctac ctgttgcccg acagatcctc ccagggaaag 960 
cgcaagactg gagtccaaag gaacaccgtg gacccgacct ttcaggagac cttgaagtat 1020 
caggtggccc ctgcccagct ggtgacccgg cagctgcagg tctcggtgtg gcatctgggc 1080 
acgctggccc ggagagtgtt tcttggagaa gtgatcattc ctctggccac gtgggacttt 1140 
gaagacagca caacacagtc cttccgctgg catccgctcc gggccaaggc ggagaaatac 1200 
gaagacagcg ttcctcagag taatggagag ctcacagtcc gggctaagct ggttctccct 1260 
tcacggccca gaaaactcca agaggctcaa gaagggacag atcagccatc acttcatggt 1320 
caactttgtt tggtagtgct aggagccaag. aatttacctg tgcggccaga tggcaccttg 1380 
aactcatttg ttaagggctg tctcactctg ccagaccaac aaaaactgag actgaagtcg 1440 
ccagtcctga ggaagcaggc ttgcccccag tggaaacact catttgtctt cagtggcgta 1500 
accccagctc agctgaggca gtcaagcttg gagttaactg tctgggatca ggccctcttt 1560 
ggaatgaacg accgcttgct tggaggaacc agacttggtt caaagggaga cacagctgtt 1620 
ggcggggatg catgctcact atcgaagctc cagtggcaga aagtcctttc cagccccaat 1680 
ctatggacag acatgactct tgtcctgcac tgacatgaag gcctcaaggt tccaggttgc 1740 
agcaggcgtg aggcactgtg cgtctgcaga ggggctacga accaggtgca gggtcccagc 1800 
tggagacccc tttgaccttg agcagtctcc • atctgcggcc ctgtcccatg gcttaaccgc 1860 
ctattggtat ctgtgtatat ttacgttaaa cacaattatg ttacctaagc ctctggtggg 1920 
ttatctcctc tttgagatgt agaaaatggc cagattttaa taaacgttgt tacccatgtc 1980 
aaaaaggaga taacccacca gaggcttagg taacataatt gtgtttaacg taaatataca 2040 
cagataccaa taggcggtta agccatggga cagggccgca gatggagact gctcaaggtc 2100 
aaaggggtct ccagctggga ccctgcacct ggttcgtagc ccctctgcag acgcacagtg 2160 
cctcacgcct gctgcaacct ggaaccttga ggccttcatg tcaatgca 2208 

<210> 24 

<211> 7611 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc__f eature 

<223> Incyte ID No: 4381669CB1 

<400> 24 

gaaacattgc cattgatgca tctggccact gtgttcagtg gacatggtat gttgtagaag 60 
ttatatttct ctagaagaaa ttttcatatg gacagtgttt tcagcatcta tttcagtatg 120 
catttccaaa tgagaatact gggacataat gcatatgctc ttcaacaagc tgcctttgtt 180 
tgcctacagc tttatagcag gccaaatcga ctcttgtctg gtgtgccact gtacaaaggg 240 
gacgtgccaa cccaagataa atctgcagtt acatctttgc tggatggact gaaccaagcc 300 
ttcgaggagg tttcatccca gagtggaggg gcaaagaggc agagtataca cttggagcag 360 
aagttgtatg atggagtctc agccacctct acttggttgg atgacgttga agaacgttta 420 
tttgttgcca cagcactttt accagaagaa acagagactt gtctcttcaa ccaagagatt 480 
cttgccaaag acattaagga aatgtctgaa gaaatggata agaacaaaaa cttgttttcc 540 
caagcttttc cagagaatgg tgataatcga gatgttattg aagatacttt gggttgtctt 600 
ttgggcaggt tatccttgct agactcagta gtgaatcaac gatgtcatca gatgaaagaa 660 
agacttcagc aaatactaaa tttccagaat gatctgaaag tgctgtttac atcactggct 720 
gacaacaaat acatcattct gcaaaaactg gcaaatgtgt ttgaacagcc cgtagcagaa 780 
caaatagagg caatacaaca ggctgaagat ggactcaaag aatttgatgc aggaatcatt 840 
gaattaaaga ggcgtggtga caagctacag gtcgagcagc cgtccatgca agaactctcc 900 
aagctccagg acatgtatga tgagctgatg atgatcattg gctcccggag gagtggtctg 960 
aatcagaacc ttacactcaa gagtcagtat gagagggccc tacaagatct ggctgacctg 1020 
ctagaaactg gtcaggagaa gatggcagga gaccagaaaa tcatcgtgtc ttccaaagag 1080 
gaaatccagc aactacttga caaacataag gaatactttc agggcctgga atctcatatg 1140 
atcttgactg aaacactctt cagaaagata atcagctttg cagtccaaaa ggaaacccag 1200 
ttccatacag agctgatggc tcaggcttct gctgtactga aacgggctca caagaggggt 1260 
gtggagctgg agtacattct agagacgtgg tcccatctgg atgaggacca gcaggagctc 1320 
agcagacagc tggaggtggt ggaaagcagc atcccaagcg tgggtctggt ggaggagaac 1380 
gaggacaggc ttattgaccg cataacactc taccagcatt taaaatctag ccttaatgaa 1440 
taccagccca aattatatca agtattagat gatgggaaac gacttctgat atccatcagc 1500 
tgctcagatc tagaaagcca actaaatcaa cttggagagt gctggctaag taacaccaat 1560 
aaaatgtcta aggaacttca cagactggaa acaatattga aacactggac cagatatcaa 1620 
agtgaatctg cagatctaat tcactggtta caatctgcaa aagaccggct agaattttgg 1680 
actcagcaat ctgtgacagt cccacaagag ctggaaatgg tccgtgatca tctaaatgct 1740 
ttcctggagt tttctaaaga agtggatgcc caatcttccc tgaaatcatc tgttctgagt 1800 
actggaaatc agctccttcg actaaaaaag gtggacacag ccacgctgcg ctctgagctg 1860 
tcgcgcattg atagccagtg gactgacctg ctaaccaata tcccagccgt ccaggagaag 1920 
ctccaccagc tccagatgga taaactgcct tcccgccatg ccatttcfcga agtcatgagt 1980 
tggatttctc taatggaaaa tgttattcag aaggatgaag ataatattaa aaattccata 2040 
ggttacaagg caattcatga ataccttcag aaatataagg gttttaagat agacattaac 2100 
tgtaaacagc tgacagtgga ttttgtgaac cagtccgtgc tacaaatcag cagtcaggat 2160 
gtggaaagta agcgtagtga taagactgat tttgctgagc aacttggagc aatgaataaa 2220 
agttggcaaa ttctgcaagg tctagtaact gagaagatcc agctgttgga aggcttattg 2280 
gaatcttggt cagaatatga aaataatgta caatgtctga aaacatggtt tgaaacccag 2340 
gaaaagagac taaaacaaca gcatcgaatt ggagatcagg cttctgttca aaatgcactg 2400 
aaagactgtc aggatctgga agatttgatt aaagcaaaag aaaaagaagt agagaaaatt 2460 
gagcagaatg gacttgcttt gattcagaac aagaaagaag acgtctctag cattgtcatg 2520 
agcacactgc gagagctcgg ccaaacctgg gcaaatttag atcacatggt tggacaatta 2580 
aagatactgc tgaaatcagt gcttgaccaa tggagtagtc acaaagtggc ctttgacaag 2640 
ataaacagtt acctcatgga ggccagatac tctctttccc gattccgtct gctgactggc 2700 
tccttagaag ctgtgcaagt tcaggtggac aatcttcaga atctccaaga tgatctggaa 2760 
aaacaggaaa ggagcttaca gaaatttggc tctatcacca accaattatt aaaagagtgt 2820 
cacccacccg tgacagaaac tcttaccaat acactgaaag aagtcaacat gagatggaat 2880 
aacttgctgg aagagattgc tgagcagcta cagtccagca aggccctact tcagctttgg 2940 
caaagataca aggactactc caaacagtgt gcttcgacag ttcagcagca ggaggatcga 3000 
accaatgagc tgttgaaggc agccacaaac aaggacattg ccgatgatga ggttgccaca 3060 
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tggattcaag attgcaacga cctcctcaaa ggactgggca cagttaaaga ttccctcttt 3120 
tttctccatg agctgggaga gcaactgaag caacaagtgg atgcttccgc agcatcagct 3180 
attcaatcgg atcaactctc tttgagtcaa cacttgtgtg ccctggagca agctctctgc 3240 
aaacagcaga cttcattaca ggctggagtt cttgattatg aaacctttgc caagagttta 3300 
gaagctttgg aggcctggat agtggaagct gaagaaatac tacaagggca ggaccctagc 3360 
cactcatctg acctctccac aatccaggaa aggatggaag aacttaaggg acagatgtta 3420 
aaattcagca gcatggctcc agatttagac cgtctaaatg agcttggata taggttaccc 3480 
ttgaatgata aggaaatcaa aagaatgcag aatctgaacc gccattggtc tctgatctcc 3540 
tctcagacta cagaaagatt cagcaagttg cagtcatttt tgctacaaca tcagactttc 3600 
ttggaaaaat gtgaaacatg gatggaattc ctagttcaga cagaacaaaa gttagcagta 3660 
gagatttcag gaaattatca gcaccttttg gaacagcaga gagcacacga ggtattgttt 3720 
caagccgaga tgttcagtcg tcagcagatt ttgcactcaa tcattattga tgggcaacgt 3780 
cttctagaac aaggtcaagt tgatgacagg gatgaattca acctgaaatt gacactcctc 3840 
agtaatcaat ggcagggagt gattcgcagg gcccagcaga ggcgggggat cattgacagc 3900 
cagattcgcc agtggcagcg ctatagggag atggcagaaa agcttcgtaa atggttggtt 3960 
gaagtgtccfc acctccccat gagtggtctc ggaagtgttc ctataccact gcaacaagca 4020 
aggaccctct ttgatgaagt gcagttcaaa gaaaaagtgt ttctgcggca acaaggcagc 4080 
tacatcctga ctgtggaggc tggcaagcaa ctccttctct cggcggacag tggcgctgag 4140 
gccgccttgc aggccgaact cgctgaaatc caagagaaat ggaaatcagc cagcatgcgg 4200 
ctggaagaac agaagaaaaa actagccttc ttgttgaaag actgggaaaa atgtgagaaa 4260 
ggaatagcag attccctgga gaaactacga actttcaaaa agaagctttc gcagtctctc 4320 
ccggatcacc atgaagagct ccatgcagaa caaatgcgtt gcaaggaatt agaaaatgca 4380 
gttgggagct ggacagatga cttgacccag ttgagcctgc tgaaggacac cctctctgcc 4440 
tatatcagtg ctgatgatat ctccattctt aatgaacgcg tagagcttct gcaaaggcag 4500 
tgggaagaac tatgccacca gctctcctta aggcggcagc aaataggtga aagattgaat 4560 
gaatgggcag tcttcagtga aaagaacaag gaactctgtg agtggttgac tcaaatggaa 4620 
agcaaagttt ctcagaatgg agacattctc attgaagaaa tgatagagaa gctcaagaag 4680 
gattatcaag aggaaattgc tattgctcaa gagaacaaaa tacagctcca acaaatggga 4740 
gaacgactt'g ctaaagccag ccatgaaagc aaagcatctg agattgaata caagctggga 4800 
aaggtcaacg accggtggca gcatctcctg gacctcattg cagccagggt gaagaagctg 4860 
aaggagaccc tggtagccgt gcagcagctt gataagaaca tgagcagcct gaggacctgg 4920 
ctcgctcaca tcgagtcaga gctggccaag ccaatagtct acgattcctg taactcggaa 4980 
gaaatacaga gaaagcttaa tgagcagcag gagcttcaga gagacataga gaagcacagt 5040 
acaggtgttg catctgtcct caacctgtgt gaagtcctgc tgcacgactg tgacgcctgt 5100 
gccactgatg ccgagtgtga ctctatacag caggctacga gaaacctgga ccggcggtgg 5160 
agaaacattt gtgctatgtc catggaaagg aggctgaaaa tcgaagagac gtggcgattg 5220 
tggcagaaat ttctggatga ctattcacgt tttgaagatt ggctgaagtc ttcagaaagg 5280 
acagctgctt ttcccagctc ttctggggtg atctatacag ttgccaagga agaactaaag 5340 
aaatttgagg ctttccagcg acaggtccac gagtgcctga cgcagctgga actgatcaac 5400 
aagcagtacc gccgcctggc cagggagaac cgcactgatt cagcatgtag cctcaaacag 5460 
atggttcacg aaggcaacca gagatgggac aacctgcaaa agcgtgtcac ctccatcttg 5520 
cgcagactca agcattttat tggccagcgt gaggagtttg agactgcgcg ggacagcatt 5580 
ctggtctggc tcacagagat ggatctgcag ctcactaata ttgaacattt ttctgagtgt 5640 
gatgttcaag ctaaaataaa gcaactcaag gccttccagc aggaaatttc actgaaccac 5700 
aataagattg agcagataat tgcccaagga gaacagctga tagaaaagag tgagcccttg 5760 
gatgcagcga tcatcgagga ggaactagat gagctccgac ggtactgcca ggaggtcttc 5820 
gggcgtgtgg aaagatacca taagaaactg atccgcctgc ctgtatacga tgagcacgac 5880 
ctctcagaca gggagctgga gctggaagac tctgcagctc tgtcggacct gcactggcac 5940 
gaccgctctg cagacagcct gctttctcca cagccttcct ccaatctctc cctctcgctc 6000 
gctcagcccc tccggagcga gcggtcagga cgagacaccc cggctagtgt ggactccatc 6060 
cccctggagt gggatcacga ctatgacctc agtcgggacc tggagtctgc aatgtccaga 6120 
gctctgccct ctgaggatga agaaggtcag gatgacaaag atttctacct ccggggagct 6180 
gttggcttat caggggacca cagtgcccta gagtcacaga tccgacaact gggcaaagcc 6240 
ctggatgata gccgttttca gatacagcaa accgaaaata tcattcgcag caaaactccc 6300 
acggggccgg agctagacac cagctacaaa ggctacatga aactgctggg cgaatgcagt 6360 
agcagtatag actccgtgaa gagactggag cacaaactga aggaggaaga ggagagcctt 6420 
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cctggctttg ttaacctgca tagtaccgaa acccaaacgg ctggtgtgat tgaccgatgg 6480 
gagcttctcc aggcccaggc attgagcaag gagttgagga tgaagcagaa cctccagaag 6540 
tggcagcagt ttaactcaga cttgaacagc atctgggcct ggctggggga cacggaggag 6600 
gagttggaac agctccagcg tctggaactc agcactgaca tccagaccat cgagctccag 6660 
atcaaaaagc tcaaggagct ccagaaagct gtggaccacc gcaaagccat catcctctcc 6720 
atcaatctct gcagccctga gttcacccag gctgacagca aggagagccg ggacctgcag 6780 
gatcgcttgt cgcagatgaa tgggcgctgg gaccgagtgt gctctctgct ggaggagtgg 6840 
cggggcctgc tgcaggatgc cctgatgcag tgccagggtt tccatgaaat gagccatggt 6900 
ttgcttctta tgctggagaa cattgacaga aggaaaaatg aaattgtccc tattgattct 6960 
aaccttgatg cagagatact tcaggaccat cacaaacagc ttatgcaaat aaagcatgag 7020 
ctgttggaat cccaactcag agtagcctct ttgcaagaca tgtcttgcca actactggtg 7080 
aatgctgaag gaacagactg tttagaagcc aaagaaaaag tccatgttat tggaaatcgg 7140 
ctcaaacttc tcttgaagga ggtcagtcgt catatcaagg aactggagaa gttattagac 7200 
gtgtcaagta gtcagcagga tttgtcttcc tggtcttctg ctgatgaact ggacacctca 7260 
gggtctgtga gtcccacatc aggaaggagc accccaaaca gacagaaaac gccacgaggc 7320 
aagtgtagtc tctcacagcc tggaccctct gtcagcagtc cacatagcag gtccacaaaa 7380 
ggtggctccg attcctccct ttctgagcca gggccaggtc ggtccggccg cggcttcctg 7440 
ttcagagtcc tccgagcagc tcttcccctt cagcttctcc tgctcctcct catcgggctt 7500 
gcctgccttg taccaatgtc agaggaagac tacagctgtg ccctctccaa caactttgcc 7560 
cggtcattcc accccatgct cagatacacg aatggccctc ctccactctg a 7611 

<210> 25 

<211> 2259 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7503554CB1 

<400> 25 

tactagcact gactgtggaa tccttaaggg cccattacat ttctgaagaa gaaagctaag 60 
atgaaggaca tgccactccg aattcatgtg ctacttggcc tagctatcac tacactagta 120 
caagctgtag ataaaaaagt ggattgtcca cggttatgta cgtgtgaaat caggccttgg 180 
tttacaccca gatccattta tatggaagca tctacagtgg attgtaatga tttaggtctt 240 
ttaactttcc cagccagatt gccagctaac acacagattc ttctcctaca gactaacaat 300 
attgcaaaaa ttgaatactc cacagacttt ccagtaaacc ttactggcct ggatttatct 360 
caaaacaatt tatcttcagt caccaatatt aatgtaaaaa agatgcctca gctcctttct 420 
gtgtacctag aggaaaacaa acttactgaa ctgcctgaaa aatgtctgtc cgaactgagc 480 
aacttacaag aactctatat taatcacaac ttgctttcta caatttcacc tggagccttt 540 
attggcctac ataatcttct tcgacttcat ctcaattcaa atagattgca gatgatcaac 600 
agtaagtggt ttgatgctct tccaaatcta gagattctga tgattgggga aaatccaatt 660 
atcagaatca aagacatgaa ctttaagcct cttatcaatc ttcgcagcct ggttatagct 720 
ggtataaacc tcacagaaat accagataac gccttggttg gactggaaaa cttagaaagc 780 
atctcttttt acgataacag gcttattaaa gtaccccatg ttgctcttca aaaagttgta 840 
aatctcaaat ttttggatct aaataaaaat cctattaata gaatacgaag gggtgatttt 900 
agcaatatgc tacacttaaa agagttgggg ataaataata tgcctgagct gatttccatc 960 
gatagtcttg ctgtggataa cctgccagat ttaagaaaaa tagaagctac taacaaccct 1020 
agattgtctt acattcaccc caatgcattt ttcagactcc ccaagctgga atcactcatg 1080 
ctgaacagca atgctctcag tgccctgtgt catccgttgg atgaacatga acaaaatcaa 1140 
cattcgattc atggagccag attcactgtt ttgcgtggac ccacctgaat tccaaggtca 1200 
gaatgttcgg caagtgcatt tcagggacat gatggaaatt tgtctccctc ttatagctcc 1260 
tgagagcttt ccttctaatc taaatgtaga agctgggagc tatgtttcct ttcactgtag 1320 
agctactgca gaaccacagc ctgaaatcta ctggataaca ccttctggtc aaaaactctt 1380 
gcctaatacc ctgacagaca agttctatgt ccattctgag ggaacactag atataaatgg 1440 
cgtaactccc aaagaagggg gtttatatac ttgtatagca actaacctag ttggcgctga 1500 



45/65 



WO 03/051902 



PCT/US02/40059 



cttgaagtct gttatgatca aagtggatgg 
gaatattaaa ataagagata ttcaggccaa 
taaaattctc aaatctagtg ttaaatggac 
tgcgcaaagt gctcgaatac catctgatgt 
atcaactgag tataaaattt gtattgatat 
atgtgtaaat gtcaccacca aaggtttgca 
taccacaaca cttatggcct gtcttggagg 
tatcagctgc ctctctccag aaatgaactg 
cttacagaaa ccaacctttg cattaggtga 
agcaggaaaa gaaaaaagta catcactgaa 
aaatatgtcc taaaaaccac caaggaaacc 
cgaaagactg cagtfcgtgct aaaaacaaaa 
aaagattact ttcgagagag aagtttaagc 

<210> 26 
<211> 1522 
<212> DNA 

<213> Homo sapiens 



atcttttcca caagataaca atggctcttt 1560 
ttcagttttg gtgtcctgga aagcaagttc 1620 
agcctttgtc aagactgaaa attctcatgc 1680 
caaggtatat aatcttactc atctgaatcc 1740 
tcccaccatc tatcagaaaa acagaaaaaa 1800 
ccctgatcaa aaagagtatg aaaagaataa 1860 
ccttctgggg attattggtg tgatatgtct 1920 
tgatggtgga cacagctatg tgaggaatta 1980 
gctttatcct cctctgataa atctctggga 2040 
agtaaaagca actgttatag gtttaccaac 2100 
tactccaaaa atgaacaaaa aaaaaaaaag 2160 
caaaacaaac aaacaaaaaa aaaagtaaaa 2220 
ttcaccaat 2259 



<220> 

<221> iuisc_f eature 

<223> Incyte ID No: 7506077CB1 



<400> 26 

gcctcagggt ccctgctgta gacggggcgg 
tcgtagggcg ctcgagatgg agcccccagc 
ctagacaagc cggagctggg accggcaatc 
cctcatccct cccgtgggag ccccctttgg 
ctgaacttga tgcgatggga ggctgtgcag 
gggaggagac cgtcccggag ccccggctcc 
agaacgcggt gggcttctgg ctgctgggcc 
tgagtgccgc ccacgacatc cttagccaca 
acccaggccc aacgccgatc ccccacaaca 
ctacggccct gtcctacctg ggcctcaccc 
tgtccatgct gggtatccct gccctgctgc 
ctgaggccca ggaccctgga ggggaagaag 
taagaaccga ggccccggag tcgaagccag 
ggtggacagt gttcaagggt ctgctgtggt 
ccgagtattt cattaaccag ggactttttg 
gtcacgctca gcaataccgc tggtaccaga 
gctcttctct ccgctgctgt cgcatccgtt 
tcaacctggt gttcctgctg gcagacgtgt 
tcttcctgat cattctgtat gaggggctcc 
acaacatcgc cctggagacc agtgatgagc 
tctctgacac actggggatc tccctgtcgg 
tctgccagct ctcctgatac tcgggatcct 
gagggacagg tcagacaccc aggcccaccc 
cttcccggca ggtctgggag tagggaaggg 
agccattgtc tcccacttgg ggagtttctt 
attttatcca tggaaaaaaa aa 



gggagagtac gatgggtggg gcgtggtggg 60 
ttccttgatg gatcgcgggg cgcgagtgcc 120 
gggcgttgat ccttgtcacc tgtcgcagac 180 
acactctatg accctggacc ctcgggggac 240 
gctcgcggcg gcgcttttcg gattccgagg 300 
ctctgttgga ccatcagggc gcgcattgga 360 
tttgcaacaa cttctcttat gtggtgatgc 420 
agaggacatc gggaaaccag agccatgtgg 480 
gctcatcacg atttgactgc aactctgtct 540 
aggccggcct ctcccctcag cagaccctgc 600 
tggccagcta tttcttgttg ctcacatctc 660 
aagcagagag cgcagcccgg cagcccctca 720 
gctccagctc cagcctctcc cttcgggaaa 780 
acattgttcc cttggtcgta gtttactttg 840 
aactcctctt tttctggaac acttccctga 900 
tgctgtacca ggctggcgtc tttgcctccc 960 
tcacctgggc cctggccctg ctgcagtgcc 1020 
ggttcggctt tctgccaagc atctacctcg 1080 
tgggaggcgc agcctacgtg aacaccttcc 1140 
accgggagtt tgcaatggcg gccacctgca 1200 
ggctcctggc tttgcctctg catgacttcc 1260 
caggacgcag gtcacattca cctgtgggca 1320 
cagagaccct ccatgaactg tgctcccagc 1380 
ctgaagcctt gtttcccttg caggggggcc 1440 
cctggcatca tgccttctga ataaatgccg 1500 

1522 



<210> 27 

<211> 1099 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc__f eature 

<223> Incyte ID No: 7506765CB1 



<400> 27 

cgcctgcttg gcggccgtcg gcgtgctctt 
gccgctgctg ctgcaggcgg cgccccgagg 
ccgcatgatc tgcgaccctt acactgccgc 
gcagccaccc ggacccagca ccgccgccct 
tcctcctcct ttcatccagg gacccaaggg 
gcgggggccc cctggagagc cgggcccgcc 
cgactcgggg cggcccgggc tgccagggct 
cggggtggtg ggcggcgggg ccggggtagg 
gctgagcgcc accttcagcg gccccaagat 
cgaaggctat gaggtgctga agttcgatga 
ccccaccacg ggcaagttca gctgccaggt 
cctcatgcgc ggcggcgacg gcaccagcat 
ccgggccagc gccattgcac aggacgccga 
ggtgctgcac ttggattcag gggacgaagt 
cggaggcaat aataacaagt acagcacgtt 
gcgcgggggg tgcgaggcgg ggtggctgca 
cttggcaaag gccactctcg attcataaca 
tccctgcgtc ctccctgccc cgctcctggc 
ggctgtgctc ctggtctcc 



ggccatggcg ctcgggctgc tcatcgccgt 60 
cgccgcgcac tatgagatga tgggcacctg 120 
gcccggcggg gagcccccgg gtgcaaaggc 180 
ggaagtcatg caggacctca gcgccaaccc 240 
cgacccgggg cgaccgggca agccagggcc 3 00 
tggacccagg ggccctccgg gagagaaggg 360 
gcaactgacg gcgggcacgg ccagcggcgt 420 
tggcgattcc gagggtgaag tgaccagtgc 480 
cgccttctat gtgggtctca agagccccca 540 
cgtggtcacc aacctcggca atcactatga 600 
gcgcggcatc tacttcttca cctaccacat 660 
gtgggcggac ctctgcaaga acgggcaggt 720 
ccagaactac gactacgcca gtaacagcgt 780 
gtatgtgaag ctggatggcg ggaaggctca 840 
ctcgggcttt cttctgtacc cggattaggg 900 
ggccgcccgg tctccgcccg ggcgcggctc 960 
cttcctgaca tctcctttgg aaaagacaaa 1020 
ctcagtgcgt ctgcgaccca ccacgctcag 1080 

1099 



<210> 28 
<211> 3311 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7490048CB1 



<400> 28 

ttagcgcaag cagctctgga gcgggcacag 
gcaggctgca cagggctctg ccggagggca 
gtggcggctg ggcgtggtga ggcggcgtct 
caaggacgca ggaggggccg acgtttccct 
gttccgaggg atcactgtgg tggagctgat 
tatctcaggt ggcaccgaca aggatggaaa 
acttgcagcc aggagtgatc tgctgaacat 
ccacctgacc aggctccgcc acgatgagat 
cgtggtgctg gaggtggagt atgagctgcc 
catttcaaag acagtggacg tctccctcta 
tagaggaggt gcccatgaag atgggcacaa 
gcccggtggc cctgccgaca gggagggctc 
cgatggaatc ccgctgcacg gggccagcca 
cagccacgag gcactctttc aggtggagta 
tgcttcggga cccttgatgg tggaaatagt 
gctcaccacc acctccctcc ggaacaagtc 
cagcgtggtg gacaggagcg gagccctgca 
caccagcatg gaacactgct cgctgcttga 
gaaggtgcgg ctggagatcc tgcctgtgcc 
ggcagtgaaa gtgcagagga gtgagcagct 
ccacagcccc cggcccggcc actgcaggat 



gagcgccggc tccaggcgtg ggaggctggg 60 
ggagggcaca atgctggcgg tgtcactcaa 120 
caaagacgat gggccctact ccaaaggagg 180 
ggcgtgccgc agacagagca ttccagagga 240 
caagaaagaa ggcagcacgc tgggcctgac 300 
gcccagggtc tccaacctga gacctggggg 360 
tggtgactat attcggtctg tgaacgggat 420 
catcaccctg ctcaagaatg tgggcgagcg 480 
cccgcccgct cctgagaata accccaggat 540 
caaggagggc aatagctttg gctttgtcct 600 
gtcccgcccg cttgtcctga cctacgtgcg 660 
cctgaaggtg ggcgacaggc tgctcagtgt 720 
tgccaccgcc ctggccaccc tgcggcagtg 780 
tgatgtggcc acccctgaca cggtggctaa 840 
caagacgcca gggtctgccc tggggatctc 900 
agtcattacc atcgaccgca tcaagccagc 960 
ccctggagac cacatcctgt ccatcgatgg 1020 
ggccaccaag ctcctggcca gcatttcaga 1080 
ccagagtcag cggccactga ggccctcaga 1140 
gcaccgctgg gacccctgcg tgccctcctg 1200 
gcccacctgg gccacacctg ctggccagga 1260 
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ccaaagccga tccttgtctt caactccctt ttcctcgccg accttgaacc acgccttttc 1320 
ctgcaacaac cccagcaccc ttccccgtgg atcccagccc atgagtcctc gaactacaat 1380 
gggccggagg aggcaccgaa gaagggaaca caagacctcg ttgtcgctag cctccagcac 1440 
ggtggggccg ggcgggcaga ttgtgcacac ggagaccacg gaggtcgtgc tctgtggaga 1500 
ccccctcagc ggctttggcc tccagctcca gggcggcatc ttcgccaccg agaccctgtc 1560 
ctccccaccc ctcgtgtgct tcatcgagcc tgacagtccg gctgagaggt gtgggctgct 1620 
gcaggtgggg gaccgtgtcc tgtccatcaa tggcattgcc accgaggacg ggactatgga 1680 
ggaagccaac cagctcctgc gggacgccgc actggcccac aaggtcgtgc tggaggtgga 1740 
gttcgatgtg gcggagtccg tcatcccaag cagtggcacc ttccacgtga agctgcccaa 1800 
gaagcgcagc gtggagctgg gcatcaccat cagctcggcc agcaggaaac gaggggagcc 1860 
cttgatcatc tccgacatca agaaaggcag cgtggcacac aggacgggca ccctggagcc 1920 
aggcgacaag ctactggcca ttgacaatat ccgcctggac aactgcccca tggaggacgc 1980 
cgtgcaaatc ctgcggcagt gcgaggacct ggtgaagctg aagatccgga aggacgagga 2040 
caactctgat gagctggaga ccacaggtgc cgtcagttac acagtggagc tgaagcgcta 2100 
cgggggtccc ctgggcatca ccatttcggg cacggaggaa ccttttgacc ccattgtcat 2160 
ctcaggcctc accaagcgtg gcctggctga gaggactggt gccatccacg tgggggaccg 2220 
cattctggcc atcaacaacg ttagcctcaa gggccggccg ctgagcgagg ccatccacct 2280 
cctgcaggtg gctggagaga ccgtcacact gaagatcaag aagcaactag accgtcccct 2340 
cctaccccgc aagtcgggca gcctcagtga gaccagtgat gctgatgagg acccagcaga 2400 
tgccctgaaa ggaggcctgc cagcagcccg cttctcgccg gctgtgccca gtgtggacag 2460 
tgctgtggag tcttgggaca gctcggccac cgagggtggc tttgggggcc cagggtccta 2520 
tacaccacag gcagcagccc ggggcacgac cccccaggag cggaggcctg gctggctgag 2580 
gggcagcccc ccacccaccg agccccggag gacgagctat accccaaccc cagctgacga 2640 
gagctttcca gaggaggagg aggaggatga ttgggagccg ccaacgagcc cagcccctgg 2700 
ccctgcccga gaggagggct tctggcgcat gtttggagaa gctctcgaag acctggagtc 2760 
atgtggtcag tcagaactgc tgagggaact ggaggcatcc atcatgacgg gcaccgtgca 2820 
gagggtggcc ctcgagggca ggcctggcca ccggccttgg cagaggggcc gggaggtacg 2880 
agcctctcct gcagaaatgg aggagctgtt gctgcctaca cccttggaga tgcacaaggt 2940 
gaccctgcac aaggacccca tgcggcatga ctttggtttc agcgtctcag atggcctcct 3000 
ggaaaaaggt gtctatgtcc acactgtgcg ccctgatggg ccagcccacc gtggaggcct 3060 
ccagcccttc gacagggtcc tgcaggtcaa ccacgtccgt acacgggact tcgactgctg 3120 
cctggcggtg ccactcctgg ccgaggcggg tgatgtcttg gagctgatca tcagccgcaa 3180 
gccgcacacg gcacacagca gccgggcccc ccgatcgcca ggccccagca gtccccggat 3240 
gctctgaagt cagcatgtgg gccgaactcc taaggggcca cccaggcaga tcctggctcc 3300 
cagacacgta g 3311 

<210> 29 
<211> 1874 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2133585CB1 

<400> 29 

gctagcacgc agtcccttgg tctcttcggt ctcctgccgc ccccgggaag cgcgctgcgc 60 

tgccgaggcg agctaagcgc ccgctcgcca tggggagccc cgcacatcgg cccgcgctgc 120 

tgctgctgct gccgcctctg ctgctgctgc tgctgctgcg cgtcccgccc agccgcagct 180 

tcccaggatc gggagactca ccactagaag acgatgaagt cgggtattca caccctagat 240 

ataaagatac cccgtggtgc tcccccatca aggtgaagta tggggatgtg tactgcaggg 300 

cccctcaagg aggatactac aaaacagccc tgggaaccag gtgcgacatt cgctgccaga 360 

agggctacga gctgcatggc tcttccctac tgatctgcca gtcaaacaaa cgatggtctg 420 

acaaggtcat ctgcagacaa aagcgatgtc ctacccttgc catgccagca aatggagggt 480 

ttaagtgtgt agatggtgcc tactttaact cccggtgtga gtattattgt tcaccaggat 540 

acacgttgaa aggggagcgg accgtcacat gtatggacaa caaggcctgg agcggccggc 600 
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cagcctcctg tgtggatatg gaacctccta gaatcaagtg cccaagtgtg aaggaacgca 660 
ttgcagaacc caacaaactg acagtccggg tgtcctggga gacacccgaa ggaagagaca 720 
cagcagatgg aattcttact gatgtcattc taaaaggcct ccccccaggc tccaactttc 780 
cagaaggaga ccacaagatc cagtacacag tctatgacag agctgagaat aagggcactt 840 
gcaaatttcg agttaaagta agagtcaaac gctgtggcaa actcaatgcc ccagagaatg 900 
gttacatgaa gtgctccagc gacggtgata attatggagc cacctgtgag ttctcctgca 960 
tcggcggcta tgagctccag ggtagccctg cccgagtatg tcaatccaac ctggcttggt 1020 
ctggcacgga gcccacctgt gcagccatga acgtcaatgt gggtgtcaga acggcagctg 1080 
cacttctgga tcagttttat gagaaaagga gactcctcat tgtgtccaca cccacagccc 1140 
gaaacctcct ttaccggctc cagctaggaa tgctgcagca agcacagtgt ggccttgatc 1200 
ttcgacacat caccgtggtg gagctggtgg gtgtgttccc gactctcatt ggcaggatag 1260 
gagcaaagat tatgcctcca gccctagcgc tgcagctcag tttcataacc cttacaaagt 1320 
tggagccatg gattttagtc tctgaataat gatgtattaa agaagcagaa ttcaacaacc 1380 
aaaaggctgt tgctgcgaat cccactctac tccttcagta tggtgctagt ggataagcat 1440 
ggcatggaca aagagcgcta tgtctccctg gtgatgcctg tggccctgtt caacctgatt 1500 
gacacttttc ccttgagaaa agaagagatg gtcctacaag ccgaaatgag ccagacctgt 1560 
aacacctgac atgatggttc ctctcttggc attcctcttc attgtctaca tagtgacatg 1620 
cacacgggaa agccttaaaa atatccttga tgtacagatt ttatttgtat attttaaaag 1680 
tctattttat tatgagcttt ctttgcactt aaaaattagc atgctgcttt ttgtacttgg 1740 
aagtgtttca taataaatta tatgagccat attattactc tttctaactt tctttactcc 1800 
atcatggctg cttgattttg ttagagaaat tagaagccca taaccataca caggctatgc 1860 
aacatgttat tcaa 1874 

<210> 30 
<211> 4211 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7509063CB1 

<400> 30 

gaaaaatgtg agaaaggaat agcagattcc ctggagaaac tacgaacttt caaaaagaag 60 
ctttcgcagt ctctcccgga tcaccatgaa gagctccatg cagaacaaat gcgttgcaag 120 
gaattagaaa atgcagttgg gagctggaca gatgacttga cccagttgag cctgctgaag 180 
gacaccctct ctgcctatat cagtgctgat gatatctcca ttcttaatga acgcgtagag 240 
cttctgcaaa ggcagtggga agaactatgc caccagctct ccttaaggcg gcagcaaata 300 
ggtgaaagat tgaatgaatg ggcagtcttc agtgaaaaga acaaggaact ctgtgagtgg 360 
ttgactcaaa tggaaagcaa agtttctcag aatggagaca ttctcattga agaaatgata 420 
gagaagctca agaaggatta tcaagaggaa attgctattg ctcaagagaa caaaatacag 480 
ctccaacaaa tgggagaacg acttgctaaa gccagccatg aaagcaaagc atctgagatt 540 
gaatacaagc tgggaaaggt caacgaccgg tggcagcatc tcctggacct cattgcagcc 600 
agggtgaaga agctgaagga gaccctggta gccgtgcagc agcttgataa gaacatgagc 660 
agcctgagga cctggctcgc tcacatcgag tcagagctgg ccaagccaat agtctacgat 720 
tcctgtaact cggaagaaat acagagaaag cttaatgagc agcaggagct tcagagagac 780 
atagagaagc acagtacagg tgttgcatct gtcctcaacc tgtgtgaagt cctgctgcac 840 
gactgtgacg cctgtgccac tgatgccgag tgtgactcta tacagcaggc tacgagaaac 900 
ctggaccggc ggtggagaaa catttgtgct atgtccatgg aaaggaggct gaaaatcgaa 960 
gagacgtggc gattgtggca gaaatttctg gatgactatt cacgttttga agattggctg 1020 
aagtcttcag aaaggacagc tgcttttccc agctcttctg gggtgatcta tacagttgcc 1080 
aaggaagaac taaagaaatt tgaggctttc cagcgacagg tccacgagtg cctgacgcag 1140 
ctggaactga tcaacaagca gtaccgccgc ctggccaggg agaaccgcac tgattcagca 1200 
tgtagcctca aacagatggt tcacgaaggc aaccagagat gggacaacct gcaaaagcgt 1260 
gtcacctcca tcttgcgcag actcaagcat tttattggcc agcgtgagga gtttgagact 1320 
gcgcgggaca gcattctggt ctggctcaca gagatggatc tgcagctcac taatattgaa 1380 
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catttttctg agtgtgatgt tcaagctaaa ataaagcaac tcaaggcctt ccagcaggaa 1440 
atttcactga accacaataa gattgagcag ataattgccc aaggagaaca gctgatagaa 1500 
aagagtgagc ccttggatgc agcgatcatc gaggaggaac tagatgagct ccgacggtac 1560 
tgccaggagg tcttcgggcg tgtggaaaga taccataaga aactgatccg cctgcctctc 1620 
ccagacgatg agcacgacct ctcagacagg gagctggagc tggaagactc tgcagctctg 1680 
tcggacctgc actggcacga ccgctctgca gacagcctgc tttctccaca gccttcctcc 1740 
aatctctccc tctcgctcgc tcagcccctc cggagcgagc ggtcaggacg agacaccccg 1800 
gctagtgtgg actccatccc cctggagtgg gatcacgact atgacctcag tcgggacctg 1860 
gagtctgcaa tgtccagagc tctgccctct gaggatgaag aaggtcagga tgacaaagat 1920 
ttctacctcc ggggagctgt tgccttatca gatgtaatga tccccgaaag ccctgaggcc 1980 
tatgtaaaac tcacagaaaa tgcaatcaaa aatacctccg gggaccacag tgccctagag 2040 
tcacagatcc gacaactggg caaagccctg gatgatagcc gctttcagat acagcaaacc 2100 
gaaaatatca ttcgcagcaa aactcccacg gggccggagc tagacaccag ctacaaaggc 2160 
tacatgaaac tgctgggcga atgcagtagc agtatagact ccgtgaagag actggagcac 2220 
aaactgaagg aggaagagga gagccttcct ggctttgtta acctgcatag taccgaaacc 2280 
caaacggctg gtgtgattga ccgatgggag cttctccagg cccaggcatt gagcaaggag 2340 
ttgaggatga agcagaacct ccagaagtgg cagcagttta actcagactt gaacagcatc 2400 
tgggcctggc tgggggacac ggaggaggag ttggaacagc tccagcgtct ggaactcagc 2460 
actgacatcc agaccatcga gctccagatc aaaaagctca aggagctcca gaaagctgtg 2520 
gaccaccgca aagccatcat cctctccatc aatctctgca gccctgagtt cacccaggct 2580 
gacagcaagg agagccggga cctgcaggat cgcttgtcgc agatgaatgg gcgctgggac 2640 
cgagtgtgct ctctgctgga ggagtggcgg ggcctgctgc aggatgccct gatgcagtgc 2700 
cagggtttcc atgaaatgag ccatggtttg cttcttatgc tggagaacat tgacagaagg 2760 
aaaaatgaaa ttgtccctat tgattctaac cttgatgcag agatacttca ggaccatcac 2820 
aaacagctta tgcaaataaa gcatgagctg ttggaatccc aactcagagt agcctctttg 2880 
caagacatgt cttgccaact actggtgaat gctgaaggaa cagactgttt agaagccaaa 2940 
gaaaaagtcc atgttattgg aaatcggctc aaacttctct tgaaggaggt cagtcgtcat 3000 
atcaaggaac tggagaagtt attagacgtg tcaagtagtc agcaggattt gtcttcctgg 3060 
tcttctgctg atgaactgga cacctcaggg tctgtgagtc ccacatcagg aaggagcacc 3120 
ccaaacagac agaaaacgcc acgaggcaag tgtagtctct cacagcctgg accctctgtc 3180 
agcagtccac atagcaggtc cacaaaaggt ggctccgatt cctccctttc tgagccaggg 3240 
ccaggtcggt ccggccgcgg cttcctgttc agagtcctcc gagcagctct tccccttcag 3300 
cttctcctgc tcctcctcat cgggcttgcc tgccttgtac caatgtcaga ggaagactac 3360 
agctgtgccc tctccaacaa ctttgcccgg tcattccacc ccatgctcag atacacgaat 3420 
ggccctcctc cactctgaac taagcagatg ccatctgcag aagtgctggt agcataagga 3480 
ggatcgggtc ataagcaatc ccaaactacc aacaagagga ccttgatctt ggcgaaagcc 3540 
ctcggtgtgg cagctttagc cctcctccag atcacatgtg tgcaaattat ggcttcagag 3600 
gtggaagata aacagtgacg ggggaacaaa cagacaacaa gaaggtttgg aagaaatctg 3660 
gtttgagact ctgaacctta gcactaagga gattgagtaa ggacctccaa agttccccgg 3720 
actcatgaat tctgggccct tggcccattc tgtgcacagc caaggacttc agtagaccat 3780 
ctgggcagct ttcccatggt gctgctccaa ccatcagata aatgaccctc ccaagcacca 3840 
tgtcagtgtc gtacaatcta ccaaccaacc agtgctgaag agattttaga accttgtaac 3900 
atacaatttt taagagctta tatggcagct tcctttttac cttgttttcc tttggggcat 3960 
gatgttttaa cctttgcttt agaagcacaa gctgtaaatc taaaaggcac ttttttttag 4020 
aggtataaag aaaaactaga tgtaataaat aagatcatgg aaggctttat gtgaaaaaag 4080 
ttgaatgtta tagtaaaaaa aaaaagatat ttatgtatgt acagtttgct aaagccaagt 4140 
tttgtttgta ttgatttctt tgcatttatt atagatatta taaaataaaa aaaaaaaaaa 4200 
aaaaaaaata t 4211 

<210> 31 

<211> 5006 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 



50/65 



WO 03/051902 



PCT/US02/40059 



<223> Incyte ID No: 5496710CB1 
<400> 31 

gaagaaaaga cggagggagg tgaggaggaa gggaggggga gagacagaga cctagagggg 60 
ctgaagaccc agacagagct ggcagagcta ctgagaagag gactggagcg ctctgagagc 120 
ctctcaagat cttttggggg agcccaataa atgtgaacat gggatctgtc acgggagctg 180 
tcctcaagac gctacttctg ttatctactc aaaattggaa cagagtcgaa gctgggaatt 240 
cctatgactg tgatgatcct cttgtgtctg ccttgcctca ggcatccttc agcagttctt 300 
ccgagctctc cagcagtcat ggtcctggat ttgcaaggct gaatagaaga gatggagctg 360 
gtggctggtc tccacttgtg tctaacaaat accagtggtt gcagattgac cttggagaga 420 
gaatggaggt caccgctgtg gccactcaag ggggatatgg tagctccaac tgggtgacca 480 
gctacctcct gatgttcagt gatagtggct ggaactggaa acaatatcgc caagaggaca 540 
gcatctgggg tttttcagga aatgcaaatg cagacagtgt tgtgtactat agactccagc 600 
cttctatcaa agccagattt ctgcgcttca tccctttgga atggaacccc aagggcagaa 660 
ttggaatgcg aatcgaagtg ttcggatgtg catacagatc agaagtggtt gatcttgatg 720 
gaaaaagttc ccttctctac agatttgatc aaaaatccct gagcccaata aaagacatta 780 
tttctttgaa attcaaaacc atgcagagtg atgggattct actccacagg gaagggccaa 840 
atggagatca catcacactg caattaagaa gagcaagact ctttttactt attaattcag 900 
gtgaagctaa actgccttcc acttccaccc tggtcaatct caccctgggc agcctgctag 960 
atgatcagca ttggcattca gtgctcatcc agcgtttggg caaacaagtc aacttcacag 1020 
tggacgaaca caggcatcat ttccatgcac ggggagaatt caatctcatg aatcttgatt 1080 
atgagatcag ctttggaggg attccagcac ctggaaaatc agtgtcattc ccacatagaa 1140 
attttcatgg atgtttagaa aatctctatt ataatggagt ggatatcatt gatttggcca 1200 
agcagcaaaa accacagatc attgctatgg gaaatgtgtc attttcttgt tcacaaccac 1260 
aatctatgcc cgtgactttt ctgagctcca ggagttattt agcactgcca gacttctctg 1320 
gagaggagga ggtttctgcc acttttcaat ttcgaacttg gaataaggca gggcttctgc 1380 
tgttcagtga acttcagctg atttcagggg gtatcctcct ctttctgagt gatggaaaac 1440 
ttaagtcgaa tctctaccag ccaggaaaat tacccagtga catcacagca ggtgtcgaat 1500 
taaatgatgg gcagtggcat tctgtctctt tatctgctaa aaagaatcac ttgagtgtgg 1560 
cggtggacgg ccagatggct tctgctgctc ctctgctggg gcctgagcag atttattcgg 1620 
gtggcaccta ttattttgga ggttgtcctg acaaaagctt tggatccaaa tgtaaaagtc 1680 
cacttggtgg atttcaggga tgtatgaggc tcatttctat cagcggcaaa gtggtagatc 1740 
tgatttcagt tcagcagggg tcccttggga acttcagtga ccttcagata gactcatgtg 1800 
gcatctcaga caggtgtttg cccaactatt gtgaacacgg tggggagtgt tcccagtcct 1860 
ggagcacctt tcattgtaac tgtaccaaca ctggttacag aggagctact tgccataact 1920 
ctatctatga gcagtcatgt gaagcctata agcacagagg aaatacttca gggttttact 1980 
atatagattc agatggaagt ggtcccctgg aaccatttct tctatattgc aatatgaccg 2040 
aaactgcatg gaccatcata cagcacaacg gctctgactt aacaagagtc agaaatacta 2100 
atccagagaa cccatatgct gggtttttcg agtatgtggc cagcatggag caacttcagg 2160 
ccactattaa ccgtgcagag cactgtgaac aggagtttac ttattactgc aagaagtcac 2220 
ggctggtcaa taagcaagat ggaacccctc tgagttggtg ggtaggaaga accaatgaaa 2280 
cgcaaaccta ctggggaggt tcttcgcctg atcttcaaaa atgtacttgt ggattagagg 2340 
gaaactgcat tgattctcag tattactgca attgtgatgc tgaccggaat gaatggacca 2400 
atgacactgg attgcttgct tataaagaac atcttccagt aactaagatc gtgattacag 2460 
acacaggccg actgcattca gaagcagctt ataaactggg gcctctgctc tgccggggag 2520 
acagatcatt ttggaattca gcttcctttg ataccgaggc ttcatatctt cattttccta 2580 
ccttccacgg agaacttagc gcggatgtat ctttcttttt taagacaaca gcttcatctg 2640 
gggtattttt agagaacttg gggattgctg attttatacg gatagagctt cgctctccga 2700 
cagtagtgac tttttcattt gatgtgggga atgggccttt tgaaatctca gtgcagtcac 2760 
ccacccactt caacgacaac cagtggcacc atgtgagggt tgaaaggaac atgaaggagg 2820 
cctcccttca agtggatcag ctgacaccaa agacacagcc cgcccccgct gatgggcatg 2880 
tcctgttaca gctcaacagt cagctcttcg tgggtggaac ggccaccaga cagagaggct 2940 
ttctgggctg cattcggtct ctgcagttga atgggatgac cctggatttg ga*agaaagag 3000 
cccaggtgac tccagaagtg cagccaggtt gtaggggaca ttgcagcagc tatgggaagt 3060 
tatgccgcaa tggagggaaa tgcagagaaa gacccattgg gttcttttgt gactgcactt 3120 
tctctgcata cacagggcca ttctgctcaa atgagatttc tgcatatttt ggatctggct 3180 
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catccgtgat atacaatttt caagaaaatt atcttttaag taaaaactcc agctcccacg 3240 
ctgcttcatt tcatggtgat atgaagctga gcagagaaat gatcaaattt agtttccgaa 3300 
caacacgaac accaagcttg ctgctttttg tgagctcctt ttacaaagaa tacctttctg 3360 
tgatcattgc caaaaatgga agtttgcaga tcaggtacaa gttaaataaa tatcaagagc 3420 
ctgatgttgt taactttgat tttaaaaaca tggctgatgg acaacttcac cacataatga 3480 
ttaacagaga agaaggagtg gtctttatag agattgacga taatagaagg agacaagttc 3540 
acctgtcatc aggcacagaa ttcagtgcag tcaaatctct ggtattgggc aggattttag 3600 
aacacagtga tgtggaccag gatactgcac tggcaggtgc gcagggcttc acaggctgcc 3660 
tctctgcagt gcagctcagc cacgtggccc ctctgaaggc agctctgcac cccagccacc 3720 
cagaccctgt cactgttaca ggacacgtga ctgagtccag ctgtatggcc cagcctggca 3780 
ctgatgccac atcaagggaa aggacacact cgtttgcaga tcattctgga acaatagatg 3840 
acagagagcc ccttgctaat gcaatcaaaa gtgactctgc agtaattgga ggtctgatag 3900 
ctgttgtgat ttttatcttg ctttgcatca ctgccatagc tgttcgcatt tatcagcaga 3960 
aaaggttata taaaagaagt gaggcaaaaa ggtcagagaa tgtagacagt gctgaggctg 4020 
ttctgaaaag tgagcttaat atacaaaatg cagtcaatga aaatcagaaa gagtacttct 4080 
tctgattggc agctatgatt taacataaaa ttatgatagt ttgttttaat agccaggagt 4140 
tctcaatgga aaaacgaatg ctcttacact gaatgtacag gcagtgggct tgcagcactg 4200 
ccatcttgcc atgtacaggc ttggggtggc tccaggaagc ctcgtccagt gatatatttc 4260 
tcatagcatt cattctatgg aacaagaaat tagatattgc tgttaatttt caactgttct 4320 
ggtatgatct aaaacaagtt taacctgctt aatggctaca gtttttacat gtgaaaactg 4380 
tagccttggt ctcttaacca tgtaatacat aagttttgtt agaggtaaaa attaaatttg 4440 
gactataatg tccttgcttt atttgagaac atttgctgtg tttgcttttg tctcctcgtg 4500 
gtgttattca tagacctgga aatgcttctc agatcctgtg tggccggtgt ttgcatcttc 4560 
agtggccaca agcataataa agcccctttg cctttctctg tattatattc aatacaatac 4620 
atcaatagtc ttgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4680 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaagg 4740 
gggggggcgc gcaaagatag gccgccgcgc gcgagggtat ttctgcgggg gcgcacactg 4800 
ctgccggggg gggcggagag catacataca cacaactcgt gatacagcgg ggccacaaaa 4860 
gggggaggcg ggaagaaaac agcgcgcaaa aaagggggca tataaaagcg cgcgcacagg 4920 
gcgcggtgtt aaaaacgggg ggaggggaga caagcgcggg agacaccaaa tagagcgggg 4980 
gaaaccccct gattcagaaa ccggga 5006 

<210> 32 
<211> 4125 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 72150826CB1 

<400> 32 

gcgacctccc ggctgcagtc ctacctcttg gaactacccg tgtttccggg cccagccctc 60 

gcagcccccc acctcctcgc cccggcccgg ggatccgttg gggccgcgtc ccccacgcgc 120 

ccccggagac gccctttccg tgtgcgcccg ggacttggtg aaactttgca ggcgccggct 180 

gcgaaatgga tttaatccga ggcgtcttgc tccggctcct gctcctggct tccagcctcg 240 

gacccggcgc ggtgtcgctc cgagcggcca tccgaaaacc aggcaaggta gggcctcctc 300 

ttgacatcaa gctgggcgca ttgaactgta cggctttcag catccagtgg aaaatgccaa 360 

ggcatcctgg aagtcccatc cttgggtaca ctgtctttta ctctgaggtt ggcgcagata 420 

aatccctgca ggagcagttg cacagcgtgc ctctcagccg ggacatcccg accacggagg 480 

aagtgattgg agatttgaaa ccaggcactg aatatcatgt gagcatagca gcttacagcc 540 

aggctggcaa agggcggctg agctctcctc ggcatgtcac cactttgtcc caagattcct 600 

gcctgcctcc tgcagctccc cagcagccac atgtcattgt ggtttcggat tctgaggtgg 660 

ccctgtcttg gaaacctgga gcgagtgaag gaagcgcccc tattcagtac tattctgtgg 720 

aattcatcag gccagatttc gacaagaagt ggacctcaat ccatgagcgg atccagatgg 780 

actccatggt tatcaagggc ctcgatccag ataccaacta ccagtttgcc gtgagggcaa 840 
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tgaattccca tggccccagc ccccgcagct ggcccagtga catcatccgg accctctgcc 900 
ctgaggaggc gggaagtggc cgctatggac cccgttatat caccgacatg ggagctggtg 960 
aggatgatga aggatttgaa gacgacttag atttggatat ttcctttgag gaggttaaac 1020 
cacttcctgc taccaaagga gggaataaga aatttttggt ggaaagcaag aagatgtcta 1080 
tatctaaccc aaagaccatt tctaggctca tcccccctac ctcagcatct ctccctgtga 1140 
ccacggtggc tccccagccc attcccatac agagaaaggg gaagaatggt gtggccataa 1200 
tgtcaaggct ctttgacatg ccttgtgatg aaactctctg ctctgctgac agcttctgtg 1260 
tcaatgacta cacctggggg ggctcgcgat gccagtgcac cctgggcaaa ggtggtgaga 1320 
gctgctcaga agatattgtt atccagtatc ctcagttctt tggccactcc tatgtaacgt 1380 
ttgaacctct gaagaattct tatcaggcat ttcaaattac tcttgaattt agggcggagg 1440 
cagaggatgg cttactgctc tactgtgggg agaacgaaca cgggaggggg gatttcatgt 1500 
ccctggctat catccgacgc tccctgcagt tcaggtttaa ttgtggaact ggggtfcgcca 1560 
tcatcgtaag tgagaccaaa atcaaactag ggggttggca cacggttatg ctctacagag 1620 
atgggctgaa cgggctgctg cagctgaaca atggcacccc agtgacaggc cagtctcagg 1680 
gccaatacag taaaattact ttccggacac ctctctatct tggtggcgct cccagcgctt 1740 
actggttggt tagagcaaca gggacaaacc gaggctttca aggctgtgtg cagtcgctcg 1800 
ctgtgaatgg gaggagaatt gacatgaggc cctggcccct gggaaaagca ctcagtgggg 1860 
ctgatgtggg ggaatgcagc agtggaatct gtgatgaggc ctcgtgcatc catggtggca 1920 
cctgcacagc aatcaaagcc gactcctaca tttgcctctg tccccttggg tttaaaggtc 1980 
gacactgtga agatgctttc accttgacca ttcctcagtt cagagagtct ctgagatctt 2040 
acgctgcaac tccctggcca ctggagcccc agcattacct ttccttcatg gaatttgaga 2100 
tcacatttcg gccagactca ggagatggtg tcctcctgta cagctatgac acaggcagca 2160 
aagacttcct gtccatcaac ttggcagggg gccacgtgga gttccgcttt gactgtggct 2220 
ctgggaccgg tgtcctcagg agtgaagatc ccctcaccct gggcaactgg cacgagcttc 2280 
gtgtatctcg cacagcaaag aatggaatct tacaggtgga taagcagaag atagtggagg 2340 
gaatggcaga gggaggcttc acacagatta agtgcaacac agacattttc attggcggag 2400 
tccccaatta tgatgatgtg aagaagaact cgggtgtcct gaagcctttc agcgggagca 2460 
tccagaagat catcctgaat gaccgaacca tccatgtgaa gcatgacttc acctccggag 2520 
tgaatgtgga gaatgcggcc cacccctgtg tgagagcccc ttgtgcccat gggggcagct 2580 
gccggcccag gaaggagggc tatgactgtg actgcccctt gggctttgag gggcttcact 2640 
gccagaaagc gatcatagaa gccattgaga tcccgcagtt tatcggccgc agttacctga 2700 
cgtatgacaa cccagatatc ttgaagaggg tgtcaggatc aagatcaaat gtgttcatga 2760 
ggtttaaaac aactgccaag gatggccttt tgctgtggag gggagacagc cccatgagac 2820 
ccaacagcga cttcatttcc ttgggccttc gggatggagc cctcgtgttc agctataacc 2880 
tgggcagtgg tgtggcatcc atcatggtga atggctcctt caacgatggt cggtggcacc 2940 
gagttaaggc cgttagggat ggccagtcag gaaagataac cgtggatgac tatggagcca 3000 
gaacaggcaa atccccaggc atgatgcggc agcttaacat caatggagct ctgtatgtgg 3060 
gtggaatgaa ggaaattgct ctgcacacta acaggcaata tatgagaggg ctcgtgggct 3120 
gtatctctca cttcaccctg tccaccgatt accacatttc cctcgtggaa gatgccgtgg 3180 
atgggaaaaa catcaacact tgtggagcca agtaacacca gctggccttg tccaagggac 3240 
agagccttct attctgagaa tcccaggggc cctcagaccc tgcctgatgc tatatgcaga 3300 
ggcccaggga ccaggtgtgt ttcctctcac caagaagaaa gtacacactg atgagaaact 3360 
gagaaccaag acaggcatcc ctgggtggcc tttcctgctg acactccacg agctgaccca 3420 
gcagaattct ctgtgtagga agcatcggac tttgtccatt gaatatgtag cggctgccag 3480 
agatcacaca tcaatgcaaa ttccagagcc tgtctgctat agctcagtga ctgtgttgtg 3540 
attcatagta cattaaaaag agagagagag agaaagaatc ccacagggca ctattaaaat 3600 
acttctctcc ttccctgact catgacactc ttcctgacag cagaatgact gtgtgacctt 3660 
gaacttcaca tttcccacat tggcccttgg attgttcgga ttaacccctt ccactcctca 3720 
ctggctggtt cactgtgttc tgactagtcc ataaaaataa agatggaagg agatcaaaca 3780 
cagaatcata atcataatgc ggcccccttc cccagagctt acctcgaatg tgaaatccct 3840 
acttcttata atgcagagtt aaggagctga gcccccatga ttttgtagct gtacttttgt 3900 
atgtgtatat ttttatagct gcagattgtc cacacagtat acttttggaa gtttgggtgg 3960 
atgactctga attcttgcac acctttccta aaaatttctc ccaagagaga cttgtttggg 4020 
ccttttgtgg tgtcagtgga atggataggt aacttgggtg ggcaacttgg ggagctgacc 4080 
attctctttt ggtaaatgtt gacggctcaa gaggatgaat atgaa 4125 
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<210> 33 
<211> 3597 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 6799476CB1 

<400> 33 

cgccgcgtct gagcgcgcgg cccagcgtca 
cccccggagc tccccgcgcc gcggagcagc 
ccgagagctc cgctcggcgt cccgcgcgcc 
gctgcgccgc cccgctcccg cgctggcccc 
gtgcggcggc ggggtctggg ccgcgcgagt 
ctaccacggc atagtcacag agaacgacaa 
gctggataaa gatgcgcctc tgcgatttgc 
aggtgagatt tgtggattta aaattcacgg 
ggataaatcc actggtgagg gagtcattcg 
gaaagactat tcattcacca tccaggccta 
cgtgaaaaag tctcataaag caactgttca 
gcccgtgttc aaggagaagt cctacaaagc 
cattttgagg gtggaggccg tggatgccga 
ctacgaaatc atcactccag acgtgccctt 
cacagagaaa ttaaactacg ggaaagaaca 
ctgtgggaag aaaagagcca cagaagatgt 
cacccctggg tggcaaggat ggaacaacag 
ggccgtcttt ccaaatatcc acctggagac 
cacagtggag ctagaaacca gccacatagg 
gaagtccctc caccggctct gtggtgcggc 
gagtggatcc ctcaactgga ccatgggcct 
ggtgtttgag ttcaacggca cccaggcagt 
ccccaaagag ccgttcacca tctcggtgtg 
ggagacaatt ctttgcagtt ctgataaaac 
tgtccacggg tgccggctga tcttcctctt 
cagacctgca gagttccact ggaagttgaa 
cgtcctcaat gtagaattcc cgagtgtgac 
cttctctgtg actgaggatt acccgctcca 
gggggcttgc tggcaaggtg gcgacctgca 
tggcttaact ctccgttccg ggaaactcgc 
ctgcaaggag gggctggacc tgcaggtcct 
agcacacccc agccagttgg tattgacctt 
ggccatgcag cacatctcgt acctgaactc 
actcaaaatc accagcacaa tcaagtgttt 
ggtagatggc tacgtgatgg ttttacagcc 
ccaccatttt gcccgagcag cttctgaatt 
tgagcttcgc atcatcagca ccatcacgag 
ggaccccaca gttcaagaat cactggtgtc 
tgaggtcacg gtggagggag aggagctgaa 
ggcccgcctg cagcagaagg gcattgaagt 
aggcgtggac accatggcca gctacgagga 
gcatgccagg tccttgcttg accggaagtt 
ctacatcagc aacgaattta aggtggaggt 
acacgccaac cacatggctg cccagccaca 
tgacctgtca ggccacaacc tggccaaccc 
gacagttgtg atcgtggtgt gcgtcagctt 



cggcggcggc ggcggcggct cctccttgga 60 
tggccccagg cccctagagc cccgagagct 120 
tccctgccgc tcccgccccg ggctggcgat 180 
ggccgcccgg ctgctgctgg ccgggctgct 240 
taacaagcac aagccctggc tggagcccac 300 
caccgtgctc ctcgaccccc cactgatcgc 360 
agagagtttt gaggtgacag tcaccaaaga 420 
gcagaatgtc ccctttgatg cagtggtagt 480 
ctccaaagag aaactggact gtgagctgca 540 
tgattgtggg aagggacctg atggcaccaa 600 
tattcaggtg aacgacgtga atgagtacgc 660 
cacggtcatc gaggggaagc agtacgacag 720 
ctgctcccct cagttcagcc agatttgcag 780 
tactgttgac aaagatggtt atataaaaaa 840 
tcaatataag ctgaccgtca ctgcctatga 900 
tttggtgaag atcagcatta agcccacctg 960 
gattgagtat gagccgggca ccggcgcgtt 1020 
atgtgacgag ccagtcgcct cagtacaggc 1080 
gaaaggctgc gaccgagaca cctactcaga 1140 
cgcgggcact gccgagctgc tgccatcccc 1200 
gcccaccgac aatggccacg acagcgacca 1260 
gaggatcccg gatggcgtcg tgtcggtcag 1320 
gatgagacat gggccattcg gcaggaagaa 1380 
agatatgaat . cggcaccact actccctcta 1440 
ccgtcaggat ccttctgagg agaagaaata 1500 
tcaggtctgt gatgaggaat ggcaccacta 1560 
tctctatgtg gatggcacgt cccacgagcc 1620 
tccatccaag atagaaactc agctcgfcggt 1680 
catgacccag tttttccgag gcaatctggc 1740 
ggataagaag gtgatcgact gtctgtatac 1800 
cgaagacagt ggcagaggcg tgcagatcca 1860 
ggagggagaa gacctcgggg aattggataa 1920 
ccggcagttc cccacgcccg gaattcgcag 1980 
taacgaggcc acctgcattt cggtcccccc 2040 
cgaggagccc aagatcagcc tgagtggcgt 2100 
tgaaagctca gaaggggtgt tccttttccc 2160 
agaagtggag cctgaagggg acggggctga 2220 
cgaggagatc gtgcacgacc tggatacctg 2280 
ccacgagcag gagagcctgg aggtggacat 2340 
gagcagctct gaactgggca tgaccttcac 2400 
ggttttgcac ctgctgcgct atcggaactg 2460 
taagctcatc tgctcagagc tgaatggccg 2520 
gaatgtaatc cacacggcca accccatgga 2580 
gttcgtgcac ccggaacacc gctcctttgt 2640 
ccacccgttc gcagtcgtcc ccagcactgc 2700 
cctggtgttc atgattatcc tgggggtatt 2760 
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tcggatccgg gccgcacatc ggcggaccat gcgggatcag gacaccggga aggagaacga 2820 
gatggactgg gacgactctg ccctgaccat caccgtcaac cccatggaga cctatgagga 2880 
ccagcacagc agtgaggagg aggaggaaga ggaagaggaa gaggaaagcg aggacggcga 2940 
agaagaggat gacatcacca gcgccgagtc ggagagcagc gaggtggagg agggggagca 3000 
gggcgacccc cagaacgcaa cccggcagca gcagctggag tgggatgact ccaccctcag 3060 
ctactgaccc gtgcccccgg ccacctcggt ttctgctttc gaagactctg ctgccatccg 3120 
ttctcccagt cccaagggtc cacgatgtac aaagtcattt cggccagtag gtgtgcagac 3180 
ccctccccgc cacgatcgtc gctgtgcttg gtgtgtagga ccctaggctc cccgcccacc 3240 
ctctgcctgg tcgcgctctt cagtcccacg aggagctgac acgtcctctc tggccgccat 3300 
ccggctcgca caggggcctc ccagcgcctc aggccccgcg tttgtgtctg gagtctcccc 3360 
ccggggagag gacactggcc cctcgcactc cagaaaagcc atgccagctg ggctcgttga 3420 
caaagggtaa aacatgctca ctcccacccg gtaatcattt ttttcttttt ttaaaaaaag 3480 
tttttatttt ttccaaacta gtgcatgtat aaataatggc aggatggggg gtactgtgta 3540 
gatgattaac tgacttttta atattttgta aataaatcgg attccttgtg ccaaaaa 3597 

<210> 34 
<211> 3569 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7509488CB1 

<400> 34 

gcgcgtctga gcgcgcggcc agcgtcacgg cggcggcggc ggcggctcct ccttggaccc 60 
ccggagctcc ccgcgccgcg gagcagctgg ccccaggccc ctagagcccc ggagagctcc 120 
gagagctccg ctcggcgtcc cgcgcgcctc cctgcctgct cccgccccgg gttggcgatg 180 
ctgcgccgcc ccgctccccg cgctgtgccc ccggccgccc ggctgctgct ggccgggctg 240 
ctgtgcggcg gcggggtctg ggccgcgcga gttaacaagc acaagccctg gctggagccc 300 
acctaccacg gcatagtcac agagaacgac aacaccgtgc tcctcgaccc cccactgatc 360 
gcgctggata aagatgcgcc tctgcgattt gcaggtgaga tttgtggatt taaaattcac 420 
gggcagaatg tcccctttga tgcagtggta gtggataaat ccactggtga gggagtcatt 480 
cgctccaaag agaaactgga ctgtgagctg cagaaagact attcattcac catccaggcc 540 
tatgattgtg ggaagggacc tgatggcacc aacgtgaaaa agtctcataa agcaactgtt 600 
catattcagg tgaacgacgt gaatgagtac gcgcccgtgt tcaaggagaa gtcctacaaa 660 
gccacggtca tcgaggggaa gcagtacgac agcattttga gggtggaggc cgtggatgcc 720 
gactgctccc ctcagttcag ccagatttgc agctacgaaa tcatcactcc agacgtgccc 780 
tttactgttg acaaagatgg ttatataaaa aacacagaga aattaaacta cgggaaagaa 840 
catcaatata agctgaccgt cactgcctat gactgtggga agaaaagagc cacagaagat 900 
gttttggtga agatcagcat taagcccacc tgcacccctg ggtggcaagg atggaacaac 960 
aggattgagt atgagccggg caccggcgcg ttggccgtct ttccaaatat ccacctggag 1020 
acatgtgacg agccagtcgc ctcagtacag gccacagtgg agctagaaac cagccacata 1080 
gggaaaggct gcgaccgaga cacctactca gagaagtccc tccaccggct ctgtggtgcg 1140 
gccgcgggca ctgccgagct gctgccatcc ccgagtggat ccctcaactg gaccatgggc 1200 
ctgcccaccg acaatggcca cgacagcgac caggtgtttg agttcaacgg cacccaggca 1260 
gtgaggatcc cggatggcgt cgtgtcggtc agccccaaag agccgttcac catctcggtg 1320 
tggatgagac atgggccatt cggcaggaag aaggagacaa ttctttgcag ttctgataaa 1380 
acagatatga atcggcacca ctactccctc tatgtccacg ggtgccggct gatcttcctc 1440 
ttccgtcagg atccttctga ggagaagaaa tacagacctg cagagttcca ctggaagttg 1500 
aatcaggtct gtgatgagga atggcaccac tacgtcctca atgtagaatt cccgagtgtg 1560 
actctctatg tggatggcac gtcccacgag cccttctctg tgactgagga ttacccgctc 1620 
catccatcca agatagaaac tcagctcgtg gtgggggctt gctggcaagg tggcgacctg 1680 
cacatgaccc agtttttccg aggcaatctg gctggcttaa ctctccgttc cgggaaactc 1740 
gcggataaga aggtgatcga ctgtctgtat acctgcaagg aggggctgga cctgcaggtc 1800 
ctcgaagaca gtggcagagg cgtgcagatc caagcacacc ccagccagtt ggtattgacc 1860 
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ttggagggag aagacctcgg ggaattggat aaggccatgc agcacatctc gtacctgaac 1920 
tcccggcagt tccccacgcc cggaattcgc agactcaaaa tcaccagcac aatcaagtgt 1980 
tttaacgagg ccacctgcat ttcggtcccc ccggtagatg gctacgtgat ggttttacag 2040 
cccgaggagc ccaagatcag cctgagtggc gtccaccatt ttgcccgagc agcttctgaa 2100 
tttgaaagct cagaaggggt gttccttttc cctgagcttc gcatcatcag caccatcacg 2160 
agagaagtgg agcctgaagg ggacggggct gaggacccca cagttcaaga atcactggtg 2220 
tccgaggaga tcgtgcacga cctggatacc tgtgaggtca cggtggaggg agaggagctg 2280 
aaccacgagc aggagagcct ggaggtggac atggcccgcc tgcagcagaa gggcattgaa 2340 
gtgagcagct ctgaactggg catgaccttc acaggcgtgg acaccatggc cagctacgag 2400 
gaggttttgc acctgctgcg ctatcggaac tggcatgcca ggtccttgct tgaccggaag 2460 
tttaagctca tctgctcaga gctgaatggc cgctacatca gcaacgaatt taaggtggag 2520 
gtgaatgtaa tccacacggc caaccccatg gaacacgcca accacatggc tgcccagcca 2580 
cagttcgtgc acccggaaca ccgctccttt gttgacctgt caggccacaa cctggccaac 2640 
ccccacccgt tcgcagtcgt ccccagcact gcgacagttg tgatcgtggt gtgcgtcagc 2700 
ttcctggtgt tcatgattat cctgggggta tttcggatcc gggccgcaca tcggcggacc 2760 
atgcgggatc aggacaccgg gaaggagaac gagatggact gggacgactc tgccctgacc 2820 
atcaccgtca accccatgga gacctatgag gaccagcaca gcagtgagga ggaggaggaa 2880 
gaggaagagg aagaggaaag cgaggacggc gaagaagagg atgacatcac cagcgccgag 2940 
tcggagagca gcgaggtgga ggagggggag cagggcgacc cccagaacgc aacccggcag 3000 
cagcagctgg agtgggatga ctccaccctc agctactgac ccgtgccccc ggccacctcg 3060 
gtttctgctt tcgaagactc tgctgccatc cgttctccca gtcccaaggg tccacgatgt 3120 
acaaagtcat ttcggccagt aggtgtgcag acccctcccc gccacgatcg tcgctgtgct 3180 
tggtgtgtag gaccctaggc tccccgccca ccctctgcct ggtcgcgctc ttcagtccca 3240 
cgaggagctg acacgtcctc tctggccgcc atccggctcg cacaggggcc tcccagcgcc 3300 
tcaggccccg cgtttgtgtc tggagtctcc ccccggggag aggacactgg cccctcgcac 3360 
tccagaaaag ccatgccagc tgggctcgtt gacaaagggt aaaacatgct cactcccacc 3420 
cggtaatcat ttttttcttt ttttaaaaaa agtttttatt ttttccaaac tagtgcatgt 3480 
ataaataatg gcaggatggg gggtactgtg tagatgatta actgactttt taatattttg 3540 
taaataaatc ggattccttg tgccaaaaa 3569 

<210> 35 
<211> 2431 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc — feature 

<223> Incyte ID No: 7510060CB1 

<400> 35 

ttcggctcga ggcgagctgg gctggcggag cagaggtgca gaagcaactg agtccaagtt 60 
gtctggcggc ttcaggtgga cccagaagac gtccccaact cagggagatt cagcgatcac 120 
tcactcgctg tacagaatga tattcctcac ggcactgcct ctgttctgga ttatgatttc 180 
agcctcccga gggggtcact ggggtgcctg gatgccctcg tccatctcgg ccttcgaagg 240 
cacgtgcgtc tccatcccct gccgctttga cttcccggat gagcfcgcggc ccgctgtggt 300 
gcatggtgtc tggtacttca atagccccta ccccaagaac taccccccgg tggtcttcaa 360 
gtcgcgcacc caagtagtcc acgagagctt ccagggccgc agccgcctcc tgggggacct 420 
gggcctgcga aactgcaccc tcctgctcag caacgtcagc cccgagctgg gcgggaagta 480 
ctacttccgt ggggacctgg gcggctacaa ccagtacacc ttctcagagc acagtgtcct 540 
ggatatcgtc aacaccccca acatcgtggt gcccccagag gtggtggcag gcacggaggt 600 
ggaggtcagc tgcatggtgc cggacaactg cccagagctg cgccctgagc tgagctggct 660 
gggccacgag gggctggggg agcccgctgt gctgggccgg ctgcgggagg acgagggcac 720 
ctgggtgcag gtgtcactgc tgcacttcgt gcccacgagg gaggccaacg gccacaggct 780 
gggctgccag gcctccttcc ccaacaccac cctgcagttc gagggctacg ccagcatgga 840 
cgtcaagtac cccccggtga ttgtggagat gaactcctcg gtggaggcca tcgagggctc 900 
ccacgtgagc ctgctctgtg gggctgacag caaccccccg ccgctgctga cctggatgcg 960 
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ggacgggaca gtcctccggg aggcggtggc cgagagcctg ctcctggagc tggaggaggt 1020 
gacccccgcc gaagacggcg tctatgcctg cctggccgag aatgcctatg gccaggacaa 1080 
ccgcaccgtg gggctcagtg tcatgtatgc accctggaag ccaacagtga acgggacaat 1140 
ggtggccgta gagggggaga cggtctctat cttgtgctcc acacagagca acccggaccc 1200 
tattctcacc atcttcaagg agaagcagat cctgtccacg gtcatctacg agagcgagct 1260 
gcagctggag ctgccggccg tgtcacccga ggatgatgga gagtactggt gtgtggctga 1320 
gaaccagtat ggccagaggg ccactgcctt caacctgtct gtggagttcg cccctgtgct 1380 
cctcctggag tcccactgcg cggcagcccg agacacggtg cagtgcctgt gcgtggtgaa 1440 
gtccaacccg gagccgtccg tggcctttga gctgccatcg cgcaatgtga ccgtgaacga 1500 
gagcgagcgg gagttcgtgt actcggagcg cagcggcctc gtgctcacca gcatcctcac 1560 
gctgcggggg caggcccagg ccccgccccg cgtcatctgc accgcgagga acctctatgg 1620 
cgccaagagc ctggagctgc ccttccaggg agcccatcga ctgatgtggg ccaagatcgg 1680 
gcctgtgggc gccgtggtcg cctttgccat cctgattgcc atcgtctgct acattaccca 1740 
gacacgcagg aaaaagaacg tgacagagag ccccagcttc tcggcagggg acaaccctcc 1800 
cgtcctgttc agcagcgact tccgcatctc tggggcacca gagaagtacg agtccaaaga 1860 
ggtttctacc ctggaatctc actgagtgcc ccaggagagc gagaggcgcc tgggatctga 1920 
gaggaggctg ctgggccttc ggggtgagcc cccagagctg gacctgagct attctcactc 1980 
ggacctgggg aaacggccca ccaaggacag ctacacgctg acggaggagc tagctgagta 2040 
tgctgaaatc cgggtcaagt . gaaggagctg ggggcagcct gcgtggctga cccccctcag 2100 
gaccctcgct ggcccccact ggctgtgggc tcccttcctc ccaaaagtat cgggggctgg 2160 
ggcaggaggg gagtgaggca ggtgacagtg aggtcctggg ggcctgacct ccccctcctt 2220 
cccagctgcc cctccctgcc agcaccccca cgccctcatt acggctcctc tctaacctcc 2280 
tttaccctca tctgtctgga ggggagctct gtctgtccgt gttatttatt gctacttcct 2340 
gcctggtctc ctgcccccac acctggccct ggggcctgta caaaagggac atgaaataaa 2400 
tgccccaaag ccaaaaaaaa aaaaaagggg g 2431 

<210> 36 
<211> 3842 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510226CB1 

<400> 36 

ggaaggctgc tccccaccac ctcgccccca ccgctgggga agggttacca tgacaaccaa 60 
cctggccggc aatcagagag gatggggcca ctgctggtcg ccctttctct cagagtaaaa 120 
gtaattgtcc aggagagtgg tgtttgggtg tggaaggggt aaggctcttt ctctggggcg 180 
aagtgggagg aggggggcag aaagcggccc cttcctcagt ctcactccag cgagggggac 240 
cgtcgcgcgg ggcctgggtc tccctcgccc tcctcggggc gcgatctccc acccgccggc 300 
ccccgcgcgg gcacgctacc cttcggtgta gacccgggtg ggcgaggctc gcgctcgccg 360 
ccggcccccc ttcctactgt cccccgccgg gcgcgcgcgc ggtggttgcg gcgggggagg 420 
gggggggtgt tggcgcggcc gcgcaggcgc gcgaggcaca gcgggcgcgc gggccggccc 480 
ggggcctcca tgatggagga gcgagcggcc gccgcggtcg ccgccgccgc ctcctcctgc 540 
cgtccgctcg gctcaggcgc gggccctggc cccactgggg cggccccggt ctccgcccct 600 
gcccccgggc cgggcccggc aggfcaaggga ggcggcggcg gaggcagccc cgggcccacg 660 
gcgggcccgg agcccctgag cctgccgggg atcctgcact ttatccagca cgagtgggcg 720 
cgcttcgaag ccgagaaagc ccgctgggag gccgagcgcg ccgagttaca ggctcaggtg 780 
gccttccttc agggagagag gaaagggcag gagaatctaa agacggacct ggtgcggcgg 840 
atcaagatgc tagagtatgc gctgaagcag gaaagggcca aatatcataa actgaagttt 900 
gggacagacc tgaaccaggg ggagaagaaa gcagatgtgt cagaacaagt ctccaatggc 960 
cccgtggaat cggtcaccct ggagaacagc ccgttggtgt ggaaggaggg gcggcagctt 1020 
ctccgacatc cagtcccagc tggcttcagt ttttaaattg cagaagtaat aagtgctcat 1080 
ttgtagaagt acctggaaga ggcgggctac acagacacca tcctcgacat gcggtccaag 1140 
cgcgtccgtt ccctgctggg ccgctcgctg gagctcaacg gggcagtgga gccgagtgaa 1200 
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ggggccccca gggctccacc aggccctgca gggctcagtg gtggggagtc gctgctggtg 1260 
aaacagatcg aggagcagat aaagaggaac gcggcaggca aagatggcaa agagcgcttg 1320 
ggcggctcag tgctggggca gatccccttc ctgcagaact gcgaggacga agacagcgac 1380 
gaggacgatg agctggacag cgtgcagcac aagaagcagc gtgtgaagct cccatccaag 1440 
gctctggtgc ccgaaatgga agacgaggat gaggaagacg actctgagga tgctatcaat 1500 
gagtttgatt tcctgggctc aggagaggat ggggaagggg ctccagaccc tcggcggtgc 1560 
actgtggatg ggagccccca tgagctggaa agccgtcggg tcaaactcca aggcattctg 1620 
gctgacctgc gggatgtgga tgggctgccc ccaaaagtga ctggcccgcc tcctggcaca 1680 
ccccagcccc ggccacatga agacgtcttc atcatggaca ctatcggggg cggggaggtg 1740 
agcctggggg acttggcaga tctcaccgtc accaacgaca acgacctcag ctgcgatctg 1800 
tctgacagca aagatgcttt taagaagacg tggaacccca agttcaccct gcgctcgcac 1860 
tacgacggca ttcgttccct ggccttccac cacagccagt cggctctgct caccgcctcc 1920 
gaggacggca cgctcaagct ctggaacctg cagaaggcgg tcacggccaa gaagaatgcg 1980 
gcgctagatg tggaacctat acatgctttc cgggctcaca ggggcccagt gttggctgtg 2040 
gctatgggca gcaacagtga atactgctac agtggcgggg cagatgcctg catccatagt 2100 
tggaagattc cagacctcag catggatccc tatgatggct acgacccaag cgtgctgagc 2160 
cacgtcctgg agggccacgg ggacgccgtg tggggcctgg ccttcagtcc cacctcccag 2220 
cgcctggcct cctgttctgc tgatggcacc gtccgcatct gggaccccag cagcagcagc 2280 
ccggcctgcc tctgcacctt ccccacagcc agcgagcacg gggtccccac ctcagtggcc 2340 
ttcaccagca ccgagcctgc ccacatcgtg gcctccttcc gctctggcga caccgtcttg 2400 
tatgacatgg aggttggcag tgccctcctc acgctggagt cccggggcag cagcggtcca 2460 
acccagatca accaagtggt gagtcatcca aaccagcctc tcaccatcac cgcccacgac 2520 
gacaggggca tccgcttcct ggacaatcgg acaggtaagc cggtgcactc catggttgca 2580 
cacctggacg cagtcacctg cctagccgtg gaccccaacg gcgcattcct gatgtcagga 2640 
agccatgact gctccctgcg tctctggagc ctggacaaca aaacgtgcgt gcaggagatc 2700 
acggcccacc gcaagaagca tgaggaggcc atccacgctg ttgcctgcca ccccagcaag 27 60 
gccctcattg ccagtgctgg cgctgatgcc ctggccaagg tcttcgtatg atgcccacct 2820 
ggccctgccc tggccgccac gctggctggg gtgtagggcc gggcaggtgg ggctgagggg 2880 
actcccagcc tcgctggagg tgcagaggcc tggcctctgt ctgtttgccg tggagcctgg 2940 
tggtgctaac ggccctttcc ccaggaatcc ctccccaacg gctgccttcc ccaagcatcc 3000 
agcctgggga tgggcccctt tcaggaatca cccccgagga tggcctcggg gtttccccct 3060 
gtgccctgac agccgctcct ggtatcagag ctgctttcca gagggcagca cagccagggc 3120 
cgggtgtctt tagggatcag tttttccaag actccagaag gtgccaggtt ctccccttga 3180 
gctcctgctc ccccatcccc catcctccca gacctgggca ggagctgtgc tgggagaaag 3240 
ggtggcagcc gggagcctca gagctgagcc ccggcctggc tgcccctccc ctccggccct 3300 
ctccactgct ttccagacac taaccaaggc ttccaggcca gggattgccc aacactccta 3360 
gggcagccct cccagcgccc catggggtcg cccatgggta gagacggctt tcttgtgccc 3420 
ctccctgggg catggagggt ggagtgggcc tcgggtcccc ctgaactccc tgtatatctg 3480 
tataaataac gggattttca tggcgccgcc ccacccgcat tatcactgtg tgatggtctc 3540 
agtcagtctc ctccctgtct ccactctttc cctctattta tttcactctc ttgtttggtt 3600 
ctaccctgca ccctcggtcc ccttccaggt tcctgtttat aagccccaac ccctctgtcc 3660 
ccatcttgta tgtgaaaact tgtctcaata aaccctttgg agtaaaaaaa aaaaaaaaaa 3720 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3780 
aaaaaaaaaa caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa ggcagcgcca 3840 
ca 3842 

<210> 37 

<211> 2892 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510385CB1 

<400> 37 
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PCT/US02/40059 



gggcagctca aggctcctcc acacacacac 
agatggggcg ggccggggct gccgccgtga 
ggctggggag tgccgccccc agccccccac 
cctggcatgg tctccagact ttcagcctgg 
tggatgagga gcgtggacgc ctgtttgtgg 
tggacaacat cagcaagcgg gccaagaagc 
aggagtgcaa ctgggcaggg aaggacattg 
tgcatgccta caaccgcacc catttgctgg 
gtgcctttgt ggaagtgggc caccgggcag 
ggatagagga tggcaagggg aagagtcctt 
tggtggggga ggagctatac tcaggggtgg 
tctttcgcag cctagggcaa cgtccaagtc 
tcaatgagcc caagtttgtc aaggtatttt 
acaaaatcta cttcttcttt cgtgagacgg 
tgtccgtgtc ccgcgttggc cagatctgcc 
tcaacaagtg gacgacgttc ctgaaggcgc 
gcgacaccca cttcgatcag ctccgtgagt 
gcagcggcgc agactccggg agcccccgcc 
ggccccttcc ccgcagagga tgtgtttctg 
ctctatgccg tcttctccac gtccagcagc 
agcatgaacg acgtgcgccg ggccttcttg 
caccagtggg tgtcatacca gggtcgcgtc 
aagacctttg gcaccttcag ttccaccaag 
cggaaccacc ccctcatgta caactctgtc 
caagttggag ccaattacac cttcactcaa 
ggacactatg acgtcctctt cattggcaca 
gtccccaagg gcagtaggcc cagcgcagag 
gaggactcgg ccgctgtcac cagcatgcaa 
gcctcgcgga gcgcggtggc ccagatcgcg 
tgcaccgaat gctgtctggc gcgtgacccc 
cgcttccagc ccagtgccaa gaggcggttc 
agcacgttgt gctccggaga ctcgtctcgt 
gtggagggca gcagcgcctt tctggagtgt 
tggactttcc agcgcgcagg ggtgacagcc 
gagcgcaccg cccggggact actgctgcgc 
ttgtgcgccg ccgtcgagca gggctttacg 
ttgagtgcta cgcaggccga acgactggcg 
ccgggcccca aactctggta ccgggacttt 
agcgcgaact ccctgcgcat gtgccgcccg 
tcgcggagaa agggccgtaa ccggaggacc 
ccgcgcagcg caacgcactg gtgaccagac 
gacaggcgag agaggagcca gacagaccct 
ttgggggtca ccggccgatg gagacaccaa 
gggcttattt attaacagga taacccttga 
cgggcgcagg attcagccgg agggaaggga 
gggccgagga ggtgcctgga gtgcccaccc 
gagcagtgag cagaaagctg tgaacaggct 
actgtactaa agtaacgcaa taaacgcatt 
aaaccccaaa aa 

<210> 38 

<211> 2756 

<212> DNA 

<213> Homo sapiens 

<220> 



ccgctgaacc ctgagcaccc tgagctgctg 60 
tcccgggcct ggccctgctc tgggcagtgg 120 
gccttcggct ctccttccaa gagctccagg 180 
agcgaacctg ctgctaccag gccttgctgg 240 
gtgccgagaa ccatgtggcc tccctcaacc 300 
tggcctggcc ggcccctgtg gaatggcgag 360 
gtactgagtg catgaacttc gtgaagttgc 420 
cctgtggcac gggagccttc cacccaacct 480 
aggagcccgt cctccggctg gacccaggaa 540 
atgaccccag gcatcgggct gcctccgtgc 600 
cagcagacct catgggacga gactttacca 660 
tccgaacaga gccacacgac tcccgctggc 720 
ggatcccgga gagcgagaac ccagacgacg 7 80 
cggtagaggc ggcgccggca ctgggacgcc 840 
ggaacgacgt gggcggccag cgcagcctgg 900 
ggctggtgtg ctcggtgccc ggcgtcgagg 960 
gcgggagtgg gtatggggtt ggggaggggg 1020 
gcagcggggc tgtgcgccct accccagcta 1080 
ttgtcctcgc gggaccaccg gaccccgctg 1140 
atcttccagg gctctgcggt gtgcgtgtac 1200 
ggaccctttg cacacaagga ggggcccatg 1260 
ccctacccgc ggccaggcat gtgccccagc 1320 ' 
gacttcccag acgatgtcat ccagtttgcg 1380 
ctgcccactg gggggcgccc tcttttccta 1440 
attgccgcgg accgggttgc agccgctgac 1500 
gacgttggca cggtgctgaa ggtgatctcg 1560 
gggctgctcc tggaggagct gcacgtgttt 1620 
atttcttcca agaggcacca gctgtacgta 1680 
ttgcaccgct gcgctgccca cggccgcgtc 1740 
tactgcgcct gggacggggt cgcgtgcacg 1800 
cggcggcaag acgtaaggaa tggcgacccc 1860 
cccgcgctgc tggaacacaa ggtgttcggc 1920 
gagccccgct cgctgcaggc gcgcgtggag 1980 
cacacccagg tgctggcaga ggagcgcacc 2040 
aggctgcggc gccgggactc gggcgtgtac 2100 
caaccgctgc gtcgcctgtc gctgcacgtg 2160 
cgggccgagg aggctgcgcc cgccgcgccg 2220 
ctgcagctgg tggagccggg cggaggtggc 2280 
cagcctgcgc tgcagtcact gcccctggag 2340. 
cacgcccctg agcctcgcgc tgagcggggg 2400 
tgtccccacg ccgggaacca agcaggagac 2460 
gaaaagaagg acgggttggg gccgggcaca 2520 
ccgacaggcc ctggctgagg gcagctgcgc 2580 
atgtagcagc cccgggaggg cggcacaggt 2640 
cggggaagcc gagctccaga gcaacgacca 2700 
tgggagacag accccacctc cttgggtagt 2760 
gggctgctgg aggtggggcg aggcaggccg 2820 
atcagccaaa gctggaatgg ccccagcaga 2880 

2892 
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<221> misc_feature 

<223> Incyte ID No: 7511618CB1 

<400> 38 

caggcgcgtt acccggcgtg ctccgcgcgg 
gggaggagga agcggtggct gctgcggatg 
ggcggctgcc gagcgcctga cccgggcctg 
cacacccgct acggtggccc tgcgcccgtt 
gctgtccagg cctgccggct ctggtgtctg 
tgcttgcctg agtcaccgcc gccgccgccg 
gcggtcctcc gggctcccag gatagcgccg 
ccgctgcctc cgcggagccc aaaatcatga 
aggaattcgc cgtgcccgag aatagctccg 
cagcaatgat gcaggagatg atgaggaacc 
tcccaggggg atataatgct ttaaggcgca 
gtgctgcaca agagcagttt ggtggtaatc 
ctggtgaagg tagtcaacct tcccgtacag 
ctccacagac ttcccagagt tcatcagctt 
ctactggtag tactgccagt ggcacttctg 
ctggagtagg agctagtatg ttcaacacac 
ctgaaaaccc acaactgatg caaaacatgt 
agtcactaag ccagaatcct gaccttgctg 
ctggaaatcc tcagcttcaa gaacaaatga 
tgcagaatcc tgatacacta tcagcaatgt 
agattcagca gggtttacag acattagcaa 
ctcctggctt gggggcatta ggaagcactg 
ccacacctag tgaaaacaca agtcccacag 
ttattcagca gatgctgcag gctcttgctg 
tcagatttca gcaacaactg gaacaactca 
acttgcaagc tctaatagca acaggaggtg 
gctcccagcc atcatagcag catttctgta 
cggctcttaa actttaaaat acctgcttta 
ttataaacaa acccaatatg atgcatttta 
tctgtatttt tcttttctgg aacagtggga 
tttattgtaa ttttttaaaa acatcacctt 
atctgtccag tttatttgct ttttaaacat 
aagcattaaa aagaagcaaa tcatttgcac 
gtgactttgg catgcatttt tgcaaacaat 
ttttatttgt atacaatata gagtatgcac 
caataggctc tctgagcaaa acacctgtaa 
aaagctgagt atttcctaat tgtatagaat 
caaaagtgcc ggttagtcag gtttgttgaa 
tctttaagga caattaattg tacagacaca 
agattgactg taaattacct taatctttgt 
aaagtgcatt tgcctaggac ttctcagctt 
tttgtaattg aaatgttgct ttcactgaaa 
cgccataaaa aaatagaact atcttttggg 
acaaatttaa aatgagttgt gagccaattg 
aaatagttaa atattgtgct tttcagagcc 
gggcagcgga atctgtcctg gatggggcca 

<210> 39 

<211> 873 

<212> DNA 

<213> Homo sapiens 



cgccggcgaa gggacgtggg ggaaggggca 60 
tcggtgtgag cgagcggcgc ctgaacacac 120 
cgccagagcc tgcaccgagc tccggggccc 180 
gctactgagg cggcgtgctc tgcattcttc 240 
ctggctcctc cttgctcgcc tgctccctcc 300 
ccacagccat ggccgagagt ggtgaaagcg 360 
ccggagccga aggtgctggc gcccccgcgg 420 
aagtcaccgt gaagaccccg aaggaaaagg 480 
tccagcagac gttggaactt gccaggaatc 540 
aggaccgagc tttgagcaac ctagaaagca 600 
tgtacacaga tattcaggaa ccaatgctga 660 
catttgcttc cttggtgagc aatacatcct 720 
aaaatagaga tccactaccc aatccatggg 780 
ccagcggcac tgccagcact gtgggtggca 840 
ggcagagtac tactgcgcca aatttggtgc 900 
caggaatgca gagcttgttg caacaaataa 960 
tgtctgcccc ctacatgaga agcatgatgc 1020 
cacagatgat gctgaataat cccctatttg 1080 
gacaacagct cccaactttc ctccaacaaa 1140 
caaaccctag agcaatgcag gccttgttac 1200 
cggaagcccc gggcctcatc ccagggttta 1260 
gaggctcttc gggaactaat ggatctaacg 1320 
caggaaccac tgaacctgga catcagcagt 1380 
gagtaaatcc tcagctacag aatccagaag 1440 
gtgcaatggg atttttgaac cgtgaagcaa 1500 
atatcaatgc agctattgaa aggttactgg 1560 
tcttgaaaaa atgtaattta tttttgataa 1620 
tttcattttg actcttggaa ttctgtgctg 1680 
aggtggagta cagtaagatg tgtgggtttt 1740 
attaaggcta ctgcatgcat cacttctgca 1800 
ttatagttgg gtgaccagat tttgtcctgc 1860 
tagcctatgg tagtaattta tgtagaataa 1920 
tctataattt gtggtacagt attgcttatt 1980 
gctgtaagat ttatactact gataattttg 2040 
atttgggact gcatttctgg aaacatactg 2100 
ctaaaaaagt gaagataaga aaatactctt 2160 
cttacagcat ctttgacaaa catctcccag 2220 
aatacagtag aaaagctgat tctggttatc 2280 
taatgtaaca ttgtctcaac attcattcac 2340 
gcagactgaa ggaacactgt agtatacccc 2400 
ctcccatagg tagtttaaca ggcattaaaa 2460 
aagtgtcttg atgtttcagt tatttttaat 2520 
tttatctgtt ttctcatgca caggcaatac 2580 
tttctgaagt gttttggtag ttctattaag 2640 
tcagagaaag ggggacgggg tgggggggtg 2700 
gcttaaataa tactggcaac caagat 2756 
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<220> 

<221> mis cofeature 

<223> Incyte ID No: 6244135CB1 

<400> 39 

tgggtggctg ggtggggaca cctgcaagtc ctccctaagc cagctaccac cctacactgt 60 
tggcctccct tctccaactg tggggatgct gctcaggcct tttgtgacat cacacctgag 120 
agtccctggg gtccagtcat tgctgctggg cacagcgagg tccaagctca ggtcgccctg 180 
ccccctaccc accatgccag atccagcatc gttgtgggca aacaattatc tggatgatct 240 
ttatggggct taagcttggg tgggagcaga tggggcatga gctggggatt tggggatggg 300 
gggaatccac acccccacgt cctggacgtt taaaaggccc tctctggcac tgggccgggg 360 
cagaggccag cagaaaagtg actggagtcc agggacatga tggatcagga ggagaagacg 420 
gaggaaggct caggcccctg tgccgaggcg ggctccccag accaggaggg cttcttcaat 480 
ctgctgagcc acgtgcaggg cgaccggatg gagggacagc gctgttcact gcaagccggg 540 

. ccgggccaga ccaccaagag ccagagcgac cccacccccg agatggacag cctcatggac 600 
atgctggcca gtacccaggg ccgccgcatg gatgaccaac gtgtgacagt cagcagcctg 660 
cccggcttcc agcccgtggg gtccaaggac ggagcacaga aacgagctgg gaccctcagt 720 
ccccaacccc tgctcacccc tcaggacccg accgctctcg gcttccgtcg gaacagcagc 780 

» ccccagcccc cgacacaagc cccctgaggg cctgaggcat cctgggtctc actcggcccc 840 
caaaaactga taaaagaata aaacacttaa atg 873 

<210> 40 

<211> 2660 

*<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506689CB1 

<400> 40 

tgagcagagg ccgtagcttg cgccgcaccc gcggccaggc ggagccaaag atgccggcct 60 
ctgctgcgcg gccccgcccg ggtcccgggc agcctacagc ctcgcccttc ccgctactgc 120 
tgctggcggt gctgagcggc ccggtatccg gccgcgtccc ccgctcggtg cccagaacct 180 
cgcttccaat ctctgaggct gactcctgtc tcacccggtt cgcagtccct cacacataca 240 
attactctgt tctccttgtg gatcctgcct cccacacact ttatgttggc gcccgggaca 300 
ccatcttcgc tttatccctg cccttctcag gggagagacc ccgcaggatt gactggatgg 360 
ttcctgaggc tcacagacag aactgtagga agaaaggcaa gaaagagggg gacctcgggg 420 
gccggaagac cctccagcag agatggacga cgtttttgaa agctgacctg ctctgtccag 480 
gcctgagcat ggccgggcct ccagtgtcct gcaggatgtt gctgtgcttc gacctgagct 540 
tggggcaggg actcccatct tttatggcat cttttcttcc cagtgggagg gggctactat 600 
ctctgctgtc tgtgccttcc gaccacaaga cattcggaca gtgctgaatg gtcccttcag 660 
agaactaaaa catgactgca acagaggact gcctgtcgtg gacaatgatg tgccccagcc 720 
cagacctgga gagtgcatca ccaacaacat gaagctccgg cactttggct catctctctc 780 
cctgcctgac cgcgtactca ccttcatccg ggaccaccca ctcatggaca ggccagtgtt 840 
tccagctgat ggccaccccc tgctggtcac tacagataca gcctatctca gagtcgtggc 900 
ccacagggtg accagcctct cagggaaaga gtatgatgtg ctctacctgg ggacagagga 960 
tggacacctc caccgagcag tgcggatcgg agctcagctc agcgttcttg aagatctggc 1020 
cttattccca gagccacagc cagttgagaa catgaaattg taccacagct ggctcctggt 1080 
tggctcccgt actgaggtga cacaagtgaa tacaaccaac tgtggccgtc tccagagctg 1140 
ctcagagtgc atcctggccc aggacccagt ctgtgcctgg agcttccggc tggatgagtg 1200 
tgtggcccat gccggggagc accgagggtt ggtccaagac atagagtcag cagatgtctc 1260 
ctctttgtgt cctaaagagc ctggagaacg tccagtagtg tttgaagttc ccgtggctac 1320 
agctgcgcat gtggtcttgc catgttctcc aagctcagca tgggcatcct gtgtgtggca 1380 
ccagcccagt ggagtgactg cactcacccc ccggcgggat ggactggagg tggtggtgac 1440 
cccaggggcc atgggcgctt atgcctgtga atgtcaggag ggtggggcag cccatgtggt 1500 
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agcagcttac agcttggtat ggggcagcca 
gggggcggga ctggctggct tcttcttggg 
gattggtcgg cgtcagcagc gacggcgaca 
cctggacctg ggggctccac cttctgggac 
ctctcctgaa gatgagcggt tgccgctggc 
attctcacca cccttcctgc ttgatccttg 
ggctcctcta gccacatgtg atgaaacatc 
tataagtgat cactggaacg gagtgaccac 
accatcccag cctctgagtt ctctttgagt 
tctctctgag cctggatggg cttggggcca 
aaatcagaac' tgctttctgc agcaaatcag 
accttcatcc ctggccccct atcttgggcc 
agctatgact ttgctttctg gttggagcct 
ggggcaaatg tgccctgagt ccttggggtg 
ctgtggggag tgcatgatcc ccatgttgca 
atctcagttt tccttccatg aaagagtacg 
gccacatgtg catgaatgac tgggccgatg 
agacaattgg aatgggggcg ttgactgcag 
ggcgtccttt ctaaacggaa tgaaacactg 
tttgctacca aaactctaaa 



gcgagatgct ccgagccggg cccacacagt 1560 
gattctcgca gcatccctga ctctcattct 1620 
gagggaactt ctggctagag acaaggtggg 1680 
cacaagctac agccaagacc ctccctcccc 1740 
cctggccaag aggggcagtg gctttggtgg 1800 
cccaagccca gcccacattc ggctaactgg 1860 
catctagagc tgggcaaatg accactagtg 1920 
tgagatgctg ggggtcactg ggcctggaag 1980 
atgagtgatt acttggattt tagtatctgt 2040 
gacctttgcc tgattcctga ttcccatgag 2100 
ggcttccccc taacatctga actcctgtaa 2160 
cattagtttt ggggatgggg cacagggcat 2220 
ggccggaagg aagagccctg gaggtggttg 2280 
gttctgctta ttcttcaagt ttatctgaat 2340 
atatggagtc tctgccctga gatcttcccc 2400 
tgtaaataca tagtgttcat aagagacctg 2460 
cttaggaata tctctgaggg atggggaggg 2520 
aactttagac cctgtagcca tagacggcat 2580 
ccctctcccc atgcccacaa cttactctac 2640 

2660 



<210> 41 
<211> 2761 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7510185CB1 



<400> 41 

atggctgtgc agcgcgccgc gtctccgcgc 
ctgctgccgc tgctgttgct gctgctgccc 
gaactggcat ttgctgtgga gccaagtgat 
ctggactgca gggtggaggg gacccctcca 
gagctgccag agagtaccca ctccaccttg 
ttcaggctgg agccgggagg cagcccttcg 
aaccgctttg ggctggtggt cagccggaag 
ttccacgtgc atccccaggc caccgtgggt 
caaatccatg ggcttcccaa acccctgatc 
acggacaatg agaggtacac attgctgccc 
gctgaggacg gtggcatctt ccactgtgtg 
cacggggcca ggctcactgt gtcaggctcg 
ctcgtggggc ctgagaacct caccctgaca 
gccacgggca acccgcgccc cattgtgtcc 
gtggagggca tccaggtgct gggcacagga 
cactctggcg tctacgtctg tgcagccaac 
cagggccggc tggtggtgca agccccagct 
aggccagctg ggaccacagc catgttcacc 
gtcacgtggc tgaaaaatgg acaggtgctg 
aacaacagca cactgaccat ttctggaatc 
gtggccgaga acagtgcggg ctcatcacag 
gaggggctcc ccgggcctcc ccgcaatgtg 
cgtgtgtcct ggagtgagcc gctggccaac 
atcaggaagg ctgctgaccc accggagctg 
tttcagcacc tggtcagcga cctggagccc 



cgcccgcccg ccccgctctg gccccggctc 60 
gcgccgagcg agggtcttgg ccactctgcc 120 
gatgttgccg tccccgggca gcctatagtg 180 
gtgcgaatca cctggaggaa gaatggggta 240 
ctggccaatg ggtccttgat gatccgtcac 300 
gatgaaggtg actatgagtg tgtggcccag 360 
gctcgcatcc aagctgcaac catgtcggac 420 
gaggagggtg gtgtggcccg cttccagtgc 480 
acttgggaga agaacagagt cccaattgac 540 
aagggggtcc tgcagatcac aggacttcga 600 
gcctcaaaca tcgccagtat ccggatcagc 660 
ggctctgggg cctacaagga gccagccatc 720 
gtgcaccaga ccgcggtgct tgagtgtgtc 780 
tggagccgcc tggatggtcg ccctatcggg 840 
aacctcatca tctcagacgt gacggtccag 900 
agacctggca cccgggtgag gagaacggca 960 
gagtttgtgc agcatcccca gtccatctcc 1020 
tgccaagccc agggtgagcc accgcctcat 1080 
gggccaggag gccacgtcag gctcaagaat 1140 
ggtcctgagg atgaagccat ttatcagtgt 1200 
gccagtgcca ggctgaccgt actgtgggct 1260 
cgggcagtct ctgtgtcttc cactgaggtg 1320 
accaaggaga tcatcggcta cgtcctgcac 1380 
gagtatcagg aggcagtcag caagagcacc 1440 
tccacagcct acagtttcta catcaaggcc 1500 
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tacacaccaa ggggggccag ctcagcctct 
gcccctgccc cacccccact gtcagtgcga 
tgggagcctt ggccccggct ggcccagcac 
gcaagcaaga cctccttcac cggccccatc 
ctcagccagc tcgaccccac tgcagtgtat 
ggagatggca atgccacagt ccgctttgtg 
ttgagcccac catgtgactg ccggaaggag 
atcgtcatcg gcatccacat cggggtcact 
ttcggccaaa ggggcagggt cctcctgtgt 
cagggtcccc ggagccagag ggaccctggc 
cagcggggcc agctgggccg agacgagaaa 
ttccccccgg ccagcgcagc agggcagccg 
gctccgtgtg aggagaccca gctctccttg 
gggaagacga cggaggcgaa gaccacagag 
gccccaccac ccccagatgg aggccctggc 
gcagcggccc gggttaccca gccagctcac 
cagagggtgg acggagcggg gcccattctc 
gggatttgga tggtcctggg ggctccccag 
caaaacccaa ggcagccttg acagggaccc 
cagttcctgc aggagcctgt tccttccctg 
ccacatgact tggaactgaa ctaacafcttt 
a 



gtgcccaccc tagctagcac cctgggtgaa 1560 
gtcctgggca gctcctcctt gcagctgctg 1620 
gagggcggct tcaagctgtt ttaccgccca 1680 
ctgctgcctg gaaccgtctc ctcctacaac 1740 
gaggtgaagc tgctcgccta caaccagcat 1800 
tctttgaggg gagcatctga gaggacagcc 1860 
gaggccgcca accagacgtc caccacaggc 1920 
tgcatcatct tctgtgtcct cttcctcctg 1980 
aaagatgtgg aaaaccagct gtcccctcca 2040 
attctggccc taaatggggc gagacgggga 2100 
cgtgtggata tgaaggagct ggagcagctg 2160 
gaccccagac ccacagatcc tgcagccccc 2220 
ctgccacttc aggggtgcgg cctgatggag 2280 
gccacggctc cctgcgccgg cctggcggct 2340 
ctcctcagtg aaggccaggc ttccaggcct 2400 
tcggaacagt agccagtgtc tggcaggctc 2460 
aggtcaaaag caagatttct actgtcatgt 2520 
tatttctatc ctgactgcct cttgggttgt 2580 
cccggcccta acacccatca ggagttggag 2640 
ggctgacgcc cccttgcctc tgcctggtac 2700 
tctttaaaaa gcaaaactta aaaaaaaaaa 2760 

2761 



<210> 42 

<211> 1440 

<212> DNA 

<213> Homo sapiens 



<220> 

<22i> misc_feature 

<223> Incyte ID No: 1420867CB1 



<400> 42 

agcagctcaa ccccatcttt ggagagatcc 
cggagctgac ggtcgccata tttgatcatg 
agacccacat tgatctggaa aaccgattct 
cctcccagta tgaaagatgg ttacaatgcc 
ctggcggggc tgtgccaacg ctgtggcctc 
aaggtgggca gcaaagtctt cctgacacca 
catgtggatt gacatctttc ctcaagatgt 
tcggcagcca atcagctatg agctcagagt 
ggatgacgag aatccactca ccggagagat 
gaaggggttg gagcatgaca agcaggagac 
ggggaacttc aattggcgct ttgtgttccg 
gagcgtccgg cgcaggtctg gaccctttgc 
agtgctggtc ctgcaggatc cctggagttg 
ggccccgagc tctgctctgt gcagctggcc 
tttcgctgcc gccgcctgag gggctggtgg 
gtggagcggg aggcgcagga ggctcaggct 
aagggccggc cagaagacct ggagttcaca 
ggcaaggtgg aggcagagtt tgagctgctg 
gggaaggggc ggaagcagcc agagcctctg 
aactggtttg tgaacccgct gaagaccttt 
accctggtgc tgctgctact ggtgctgctc 
atccctggcc agatcagcca ggtcatcttc 
ttggacactc acccagggtg ccaacccttc 



tggagctaag catctctctc ccagctgaga 60 
acctcgtggg ttctgacgac ctcatcgggg 120 
atagccacca cagagcaaac tgtgggctgg 180 
tggcgtgatg cattctggcc ttcgcagatc 240 
cctgcccctg aataccgagc cggtgctgtc 300 
ccggagaccc tgcccccagg gatctcttca 360 
gcctgctcca cccccagttg acatcaagcc 420 
tgtcatctgg aacacggagg atgtggttct 480 
gtcgagtgac atctatgtga agagctgggt 540 
agacgttcac ttcaactccc tgactgggga 600 
ctttgactac ctgcccacgg agcgggaggt 660 
cctggaggag gcggagttcc ggcagcctgc 720 
cagctaccag acatggtgca tggggcccgg 780 
cgcaatgggg ccgggccgag gtgcaatctg 840 
ccggtagtga agctgaagga ggcagaggac 900 
ggcaagaaga agcgaaagca gaggaggagg 960 
gacatgggtg gcaatgtgta catcctcacg 1020 
actgtggagg aggccgagaa acggccagtg 1080 
gagaaaccca gccgccccaa aacttccttc 1140 
gtcttcttca tctggcgccg gtactggcgc 1200 
accgtcttcc tcctcctggt cttctacacc 1260 
cgtcccctcc acaagtgact ctcgctgacc 1320 
aatgcctgct cctggaagtc tttcttaccc 1380 
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atgtgagcta ccccagagtc tagtgcttcc tctgaataaa cctatcacag ccactgaaaa 1440 

<210> 43 

<211> 1066 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 7512289CB1 

<400> 43 

agcagtctcc gccctcgcac 
ctctgccgcc gcggagcttc 
gccgtagagg ctgcaatcgc 
ccttcgaggg cgccccaggc 
agaaggaggc caagaaggac 
acgccgtcgc ggtggactgc 
cctggtgttg gtgcatgtgc 
gagcatactt gtacaaatat 
agtacatcaa agatgatgtc 
accagacaat tgaagaaaat 
tgcctgtccc agagtttgca 
ctggaaccta tttgcctcag 
ttgaaaacat tgatcacctg 
acaaactgca acgcagagaa 
tcgcaattcg gcattttgaa 
tcaagaaaaa cattattgag 
gtgcagtgat tattttttaa 
ttttcatctc attaattcaa 

<210> 44 
<211> 4179 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 7512447CB1 

<400> 44 

ggaggcgctc ggggcatcga ggctgggagg cgggtccgcc ccctattgtg tagcggcgag 60 
agtggagccg agcggtgcgg agcagatctg gtggttctcc ggagagcagc ttcctcgggt 120 
gttacatgag ccaagccctc actgtacaga agagtgagag ctgaaacctg ttccctgagc 180 
tgatcagaag gacatccctt ggcccctcca tctgggctcc tgtggatagg aggggctggg 240 
tgagcaggcc agctgggcta tggtgtggtg cctcggcctg gccgtcctca gcctggtcat 300 
cagccagggg gctgacggtc gagggaagcc tgaggtggta tcggtggtgg gccgggctgg 360 
ggagagtgtg gtgctgggct gtgacctgct gcccccggcc ggccggcccc ccctgcatgt 420 
catcgagtgg ctgcgctttg gattcctgct tcccatcttc atccagttcg gcctctactc 480 
tccccgaatt gaccctgatt acgtgggacg agtccggctg cagaaggggg cctctctcca 540 
gattgagggt ctccgggtgg aagaccaggg ctggtacgag tgccgcgtgt tcttcctgga 600 
ccagcacatc cctgaagacg attttgctaa cggctcctgg gtgcgtctga cagtcaattc 660 
accccctcaa ttccaggaga cacctcctgc tgtgttggaa gtgcaggaac tggagcctgt 720 
gaccctgcgt tgtgtggccc gtggcagccc cctgcctcat gtgacgtgga agctccgagg 780 
aaaggacctt ggccagggcc agggccaggt gcaagtgcag aacgggacgc tgcggatccg 840 
ccgggtagag cgaggcagct ctggggtcta cacctgccaa gcctccagca ctgagggcag 900 



gggagctggg aggctgcgag 
ccgaacctct tcagccgccc 
agccgggagc ccgcagcccg 
cgcgccatgg tgaaggtgac 
gagcccaaga gcggcgagga 
aaggacccag atgatgtggt 
tttggactag catttatgct 
tttgcacttc aaccagatga 
atcttaaatg agccctctgc 
attaaaatct ttgaagaaga 
gatagtgatc ctgccaacat 
tcctatctga ttcatgagca 
ggtttcttta tttatcgact 
actattaaag gtattcagaa 
aacaaatttg ccgtggaaac 
gaaaattaat atcacagcat 
agtcttcttt catgtaagta 
ttaaaaccat taccttaaaa 



atccctaccg cagtagccgc 60 
ggagccgctc ccggagcccg 120 
cgccccgagc ccgccgccgc 180 
gttcaactcc gctctggccc 240 
ggcgctcatc atcccccccg 300 
accagttggc caaagaagag 360 
tgcaggtgtt attctaggag 420 
cgtgtactac tgtggaataa 480 
agatgcccca gctgctctct 540 
agaagttgaa tttatcagtg 600 
tgttcatgac tttaacaagg 660 
catggttatt actgatcgca 720 
gtgtcatgac aaggaaactt 780 
acgtgaagcc agcaattgtt 840 
tttaatttgt tcttgaacag 900 
aaccccaccc tttacatttt 960 
gcaaacaggg ctttactatc 1020 
ttttaa 1066 
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cgccacccac gccacccagc tgctagtgct aggaccccca gtcatcgtgg tgccccccaa 960 
gaacagcaca gtcaatgcct cccaggatgt ttcattggcc tgccatgctg aggcataccc 1020 
tgctaacctc acctacagct ggttccagga caacatcaat gtcttccaca ttagccgcct 1080 
gcagccccgg gtgcggatcc tggtggacgg gagcctgcgg ctgctggcca cccagcctga 1140 
tgatgccggc tgctacacct gtgtgcccag caatggcctc ctgcatccac cctcagcctc 1200 
tgcctacctc actgtgctct gcatgccggg ggtgatccgc tgcccggttc gtgccaaccc 1260 
cccactgctc tttgtcagct ggaccaagga tggaaaggcc ctgcagctgg acaagttccc 1320 
tggctggtcc cagggcacag aaggctcact gatcatcgcc ctggggaacg aggatgccct 1380 
gggagaatac tcctgcaccc cctacaacag tcttggtacc gccgggccct ctcctgtgac 1440 
ccgcgtgctg ctcaaggctc ccccagcttt tatagagcgg cccaaggaag aatatttcca 1500 
agaagtaggg cgggagctgc tcatcccctg ctccgcccaa ggggaccctc ctcctgttgt 1560 
ctcttggacc aaggtgggcc gggggctgca aggccaggcc caggtggaca gcaacagcag 1620 
cctcatcctg cgaccattga ccaaggaggc ccacgggcac tgggaatgca gtgccagcaa 1680 
tgctgtggcc cgagtggcca cctccacgaa cgtctacgtg ctgggcacta gccctcatgt 1740 
tgtcaccaat gtgtccgtgg tggctttgcc caagggtgcc aatgtctcct gggagcctgg 1800 
ctttgatggt ggttatctgc agagattcag tgtctggtac accccactgg ccaagcgtcc 1860 
tgaccgaatg caccatgact gggtgtcctt ggcagtgcct gtgggggctg ctcacctcct 1920 
agtgccaggg ctgcagcccc acacccagta ccagttcagc gtgctagctc agaacaagct 1980 
ggggagtggt cccttcagcg aaatcgtctt gtctgctccg gaagggcttc ctaccacgcc 2040 
agctgcaccc gggcttcccc caacagagat accgcctccc ctgtcccctc cgcggggtct 2100 
ggtggcagtg aggacacccc ggggggtact cctgcattgg gatcccccag agctggtccc 2160 
taagagactg gatggctacg tcttggaagg ccggcaaggc tcccagggct gggaggtgct 2220 
ggacccggct gtggcaggca cagaaacaga gctgctggtg ccaggcctca tcaaggatgt 2280 
tctctacgag ttccgcctcg tggccttcgc gggcagcttc gtcagcgacc ccagcaacac 2340 
ggccaacgtc tccacttccg gtctggaggt ctacccttcg cgcacgcagc tgccgggcct 2400 
cctgcctcag cccgtgctgg ccggcgtggt gggcggagtc tgctttctgg gagtggccgt 2460 
ccttgtgagc atcctggccg gctgcctcct gaaccggcgc agggctgccc gccgccgccg 2520 
caagcgcctc cgccaagatc cacctcttat cttctctccg accgggaagt cagctgcacc 2580 
ctctgctctg ggctcaggca gtcctgacag cgtggcgaag ctgaagctcc agggatcccc 2640 
agtccccagc ctgcgccaga gtctgctctg gggggatcct gccggaactc ccagccccca 2700 
cccggatcct ccatctagcc ggggaccctt acctctggag cccatttgcc ggggcccaga 2760 
cgggcgcttt gtgatggggc ccactgtggc ggccccccag gaaaggtcag gccgggagca 2820 
ggcagaacct cggactccag cccagcgtct ggcccggtcc tttgactgta gcagcagcag 2880 
ccccagtggg gcaccccagc ccctctgcat tgaagacatc agccctgtgg caccccctcc 2940 
agcagcccca cccagtccct tgccaggtcc tggacccctg ctccagtacc tgagcctgcc 3000 
cttcttccga gagatgaatg tggatgggga ctggcccccg cttgaggagc ccagccctgc 3060 
tgcaccccca gattacatgg atacccggcg ctgtcccacc tcatctttcc ttcgttctcc 3120 
agaaacccct cctgtatccc ccagggaatc acttcctggg gctgtggtag gggctggggc 3180 
cactgcagag cccccttaca cagccctggc tgactggaca ctgagggagc ggctgctgcc 3240 
aggccttctc cctgctgccc ctcgaggcag cctcaccagc cagagcagtg ggcgaggcag 3300 
cgcttcgttc ctgcggcccc cctccacagc cccctctgca ggaggcagct acctcagccc 3360 
tgctccagga gacaccagca gctgggccag tggccctgag agatggcccc gaagggagca 3420 
tgtggtgaca gtcagcaaga ggaggaacac atctgtggac gagaactatg agtgggactc 3480 
agaattccct ggggacatgg aattgctgga gactttgcac ctgggcttgg ccagctcccg 3540 
gctcagacct gaagctgagc cagagctagg tgtgaagact ccagaggagg gctgcctcct 3600 
gaacactgcc catgttactg gccctgaggc ccgctgtgct gcccttcggg aggaattcct 3660 
ggccttccgc cgccgccgag atgctactag ggctcggcta ccagcctatc gacagccagt 3720 
cccccacccc gaacaggcca ctctgctgtg aacatccctg atgtgaggct gtgaaaaggc 3780 
atatggacct gcaaaggagg cccccaacca gacagaccta gtttcaaacg agggcactgc 3840 
ccctgcctgc ccctttggtg cccaggcaca gaccctgata gtgggtttgg gtcaccttgg 3900 
tatggaatgt atgtgctgac cccctaggtg agtctgggga ttggaacagg gatcttaggt 3960 
ctgcctctct ctctctctct ctctctctct ctctctctgt gtgtgtgtgt gtgtgtgtga 4020 
agttttttac aggtgaataa cacaaagttt gaaagacaaa aaaaaaaagg ggggggccgc 4080 
gaatagtgag ctcgtcaccc cggaaataaa tcccggaccg gtcactgagg gtaaccagtc 4140 
ccctataggg agtccgtata agagtcgcgg aacacaggg 4179 
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transformed with said polynucleotides, promoter sequences, and compositions comprising same. 
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transformed with said polynucleotides, promoter sequences, and compositions comprising same. 
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ID NO: 9, an isolated polypeptide comprising SEQ ID NO: 9, an isolated polynucleotides comprising SEQ ID NO: 31 , cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 10 claim(s) 1-7, 9-10, 12-13, 17-18, 65, and 87 (each inpart), drawn to a method of producing apolypeptide comprising SEQ 
ID NO: 10, an isolated polypeptide comprising SEQ ID NO: 10, an isolated polynucleotides comprising SEQ ID NO: 32, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 11, claim(s) 1-7, 9-10, 12-13, 17-18, 66, and 88 (eachinpart), drawn to a method of producing apolypeptide comprising SEQ 
ID NO: 11, an isolated polypeptide conprising SEQ ID NO: 11, an isolated polynucleotides comprising SEQ ID NO: 33, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 12, daim(s) 1-7, 9-10, 12-13, 17-18, 67, and 89 (eachinpart), drawn to a method of producing apolypeptide comprising SEQ 
ID NO: 12, an isolated polypeptide comprising SEQ ID NO: 12, an isolated polynucleotides comprising SEQ ID NO: 34, cells 

transformed with said polynucleotides, promoter sequences, and compositions comprising same. 
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Group 13 claim(s) 1-7, 9-10, 12-13, 17-18, 68, and 90 (each in part), drawn to a method of producing a polypeptide comprising SEQ 
ID NO: 13, an isolated polypeptide comprising SEQ ID NO: 13, an isolated polynucleotides comprising SEQ ID NO: 35, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 14claim(s) 1-7, 9-10, 12-13, 17-18, 69, and 91 (each in part), drawn to a method of producing a polypeptide comprising SEQ 
ID NO: 14, an isolated polypeptide comprising SEQ ID NO: 14, an isolated polynucleotides comprising SEQ ID NO: 36, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 15 claim(s) 1-7, 9-10, 12~t3, 17-18, 70, and 92 (each in part), drawn to a method of producing a polypeptide comprising SEQ 
ID NO: 15, an isolated polypeptide comprising SEQ ID NO: 15, an isolated polynucleotides comprising SEQ ID NO: 37, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 16 claim(s) 1-7, 9-10, 12-13, 17-18, 71, and 93 (each in part), drawn to a method of producing a polypeptide comprising SEQ 
ID NO: 16, an isolated polypeptide comprising SEQ ID NO: 16, an isolated polynucleotides comprising SEQ ID NO: 38, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 17 claim(s) 1-7, 9-10, 12-13, 17-18, 72, and 94 (each in part), drawn to a method of producing a polypeptide comprising SEQ 
ID NO: 17, an isolated polypeptide comprising SEQ ID NO: 17, an isolated polynucleotides comprising SEQ ID NO: 39, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 18 claim(s) 1-7, 9-10, 12-13, 17-18, 73 and 95 (each inpart), drawn to a method of producing a polypeptide comprising SEQ 
ID NO: 18, an isolated polypeptide comprising SEQ ID NO: 18, an isolated polynucleotides comprising SEQ ID NO: 40, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 19claim(s) 1-7, 9-10, 12-13, 17-18, 74,and96 (each in part), drawn to a method of producing a polypeptide comprising SEQ 
ID NO: 19, an isolated polypeptide comprising SEQ ID NO: 19, an isolated polynucleotides comprising SEQ ID NO: 41, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 20 claim(s) 1-7, 9-10, 12-13, 17-18, 75 and 97 (each inpart), drawn to a method of producing a polypeptide comprising SEQ 
ID NO: 20, an isolated polypeptide comprising SEQ ID NO: 20, an isolated polynucleotides comprising SEQ ID NO: 42, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 21, claim(s) 1-7, 9-10, 12-13, 17-18, 76, and 98 (eachinpart), drawn to a method of producing a polypeptide comprising SEQ 
ID NO: 21, an isolated polypeptide comprising SEQ ID NO: 21, an isolated polynucleotides comprising SEQ ID NO: 43, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 22, daim(s) 1-7, 9-10, 12-13, 17-18, 77, and 99 (each in part), drawn to a method of producing a polypeptide comprising SEQ 
ID NO: 22, an isolated polypeptide comprising SEQ ID NO: 22 , an isolated polynucleotides comprising SEQ ID NO: 44, cells 
transformed with said polynucleotides, promoter sequences, and compositions comprising same. 

Group 23 claim(s) 8, drawn to a transgenic organism. 

Group 24 claim(s) 11, 31, 32, 34, and 36-43 drawn to a method of preparing an antibody, polyclonal and rnonoclonal antibodies, and 
con^ositions comprising same. 

Group 25 claim(s) 14-16, drawn to a method of detecting a target polynucleotides in a sample. 

Group 26 claim(s) 19 and 22, drawn to a method for treating a disease or condition associated with decreased expression of functional 
NTRN. 

Group 27 claim(s) 20 and 21, drawn to a method of screening for effectiveness as an agonist of a polypeptide and compositions 
comprising same. 

Group 28 claim(s) 23 and 24, drawn to drawn to a method of screening far effectiveness as an antagonist of a polypeptide and 
compositions comprising same. 

Group 29 claim(s) 25, drawn to drawn to a method far treating a disease or condition associated with overexpression of functional 
NTRN. 

Group 30 claim(s) 26, drawn to a method of screening for a compound that specifically binds to a polypeptide. 
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Group 31, claim(s) 27, drawn to a method of screening for a compound that modulates the activity of a polypeptide. 

Group 32, claim(s) 28, drawn to a method of screening a compound for effectiveness in altering expression of a target polynucleotide. 

Group 33 claim(s) 29, drawn to a method of assessing toxicity of a test compound 

Group 34 claim(s) 30 and 35, drawn to a method for a diagnostic test for a condition or disease associated with the expression of 
NTRAN in a biological sample. 

Group 35 claim(s) 44, drawn to a method of detecting a polypeptide. 
Group 36 claim(s) 45, drawn to a method of purifying a polypeptide. 
Group 37 claim(s) 46 and 48-55, drawn to amicroarray. 

Group 38 claim(s) 47, drawn to a method of generating an expression profile of a sample. 

The inventions listed as Groups 1-38 do not relate to a single general inventive concept under PCT Role 13.1 because, under PCT 
Rule 13.2, they lack the same or corresponding special technical features for the following reasons: 

Group 1 recites the special technical feature of SEQ ID NO: 1 , which is not required by any of the other groups. 



Continuation of B. FIELDS SEARCHED Item 3: 
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human neurotransmission- associated protein, SNAP, SNARE, synaptotagmin, Rab, WD Repeat, vescile, neurotransmitter, 
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Box I Observations where certain claims were found unsearchable (Continuation of Item 1 of first sheet) 



This international report has not been established in respect of certain claims under Article 17(2)(a) for the foUowing reasons: 

1. \^\ Claim Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



CZI Claim Nos. : 

because they relate to parts of the international application that do not comply with the prescribed requirements to 
such an extent that no meaningful international search can be carried out, specifically: 



3. ^] Claim Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 

6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 



This International Searching Authority found multiple inventions in this rnternational application, as follows: 
Please See Continuation Sheet 



As all required additional search fees were timely paid by the applicant, this international search report covers all 
searchable claims. 

2. I I As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 

payment of any additional fee. 

3. IX] As only some of the required additional search fees were timely paid by the applicant, this international search 

report covers only those claims for which fees were paid, specifically claims Nos.: 1-7, 9-10, 12-13, 17-18, 56, 69, 
78, and 91 (each in part) 



4. No required additional search fees were timely paid by the applicant. Consequently, this international search report 

is restricted to the invention first mentioned in the claims; it is covered by daims Nos. : 



Remark on Protest 



The additional search fees were accompanied by the applicant's protest. 
| No protest accompanied the payment of additional search fees. 
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